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(54) Human g-protein-coupfed receptor 

(57) Polynucleotide and polypeptide sequences are 
described. The polypeptide sequences comprise one or 
more of (a) a polypeptide having the deduced amino 
acid sequence translated from the polynucleotide se- 
quence in SEQ ID NO: 1 and variants, fragments, homo- 



logues, analogues and derivatives thereof: (b) a 
polypeptide of SEQ ID NO: 2 and variants, fragments, 
homologues, analogues and derivatives thereof; or (c) 
a polypeptide encoded by the cDNA of NCIMB 41065 
and variants, fragments, homologues, analogues and 
derivatives thereof. 
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Descriptk 



Technical fieid 

[0001] The present invention relates to a novel polynucleotide sequence which encodes a novel polypeptide belong- 
ing to the class of proteins known as G-protem coupled receptors (GPCRs) Tne oresent invention also relates, inter 
alia, to processes for producing the polypeptide and its uses. 



Background of the invention 

10 

[0002] Cells and tissues respond to a wide variety of extracellular signalling molecules through the interaction of 
these molecules with specific cell-surface receptors. One such class of receptors are known as G-protem coupled 
receptors (GPCRs) and these are characterised by containing a series of 7 hydrophobic transmembrane segments. 
Upon binding an extracellular ligand to its receptor, intracellular signals are initiated via interactions with heterotrimenc 

is g proteins which, m turn, can lead to a number of different intracellular events depending upon which receptor has 
been activated. For example some GPCRs influence adenyl cyclase activity whereas others act via phospholipase C. 
[0003] Memoers of the GPCR superfamily respond to a wide variety of hgands including small molecule amines 
(such as serotonin, dopamine, acetylcholine), lipid-derived mediators (such as LpA), amino acid derivatives (such as 
glutamate) and neurotransmitter peptides and hormones (such as neurokinin, galanin, glucagon, gastrin). Although 

20 GPCRs are activated by a broad range of tigands, it should be noted that individual GPCRs have a small and very 
specific repertoire of ligands. Based upon an analysis of the primary structure of a novel GPCR, it is now possible to 
classify them into specific sub-families, thereby narrowing the range of potential ligands. 

[0004] in many cases, the endogenous ligands of GPCRs are relatively small, enabling them to be mimicked or 
blocked by synthetic analogues. For example drugs such as prazosin, doxazosin, cimetidine, ranitidine are all effective 
25 antagonists of their respective target GPCRs. 

[0005] Thus, as the modulation of GPCRs can have therapeutic consequences, there is a continued need to provide 
new GPCRs and their associated agonists and antagonists. 



Summary aspects of the invention 

30 

[0006] In a broad aspect, the present invention relates to novel amino acid sequences. In this regard, a specific novel 
ammo acid sequence has been isolated and it is to be understood that the invention covers that sequence as well as 
novel variants fragments, derivatives and homologues thereof. 

[0007] I n another broad aspect, the present invention relates to novel nucleic acid sequences. I n this regard, a specific 
35 novel nucleic acid sequence has been isolated and it is to be understood that the invention covers that sequence as 
well as novel variants, fragments, derivatives and homologues thereof. 
[0008] Thus, in brief, some aspects of the present invention relate to: 



1 . Novel amino acids. 

40 2. Novel nucleotide sequences. 

3. Assays using said novel sequences. 

4. Compounds/compositions identified by use of said assays. 

5 Expression systems comprising or expressing said novel sequences. 

6. Methods of treatment based on said novel sequences. 

45 7 Pharmaceutical compositions based on said novel sequences. 

[0009] Other aspects concerning the ammo acid sequence of the present invention and/or the nucleotide sequence 
of the present invention include: a construct comprising or capable of expressing the sequences of the present inven- 
tion, a vector comprising or capable of expressing the sequences of the present invention; a plasmid comprising or 

so capable of expressing the sequences of the present invention; a cell transfected or virally-transduced with a construct/ 
vector/piasmid comprising or capable of expressing the sequences of the present invention; a tissue comprising or 
capable of expressing the sequences of the present invention; an organ comprising or capable of expressing the 
sequences of the present invention; a transformed host comprising or capable of expressing the sequences of the 
present invention; and a transformed organism comprising or capable of expressing the sequences of the present 

55 invention. The present invention also encompasses methods of expressing the same, such as expression in a micro- 
organism; including methods for transferring the same. 

[001 0] For ease of reference, aspects of the present invention are now discussed under appropriate section headings 
However, the teachings under each section are not necessarily limited to each particular section. 
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[0011] In the following commentary references to "nucleotide sequence of the present invention" and "ammo acid 
sequence of the present invention" refer respectively to any one or more of the nucleotide sequences presented or 
discussed herein and to any one or more of the ammo acid sequences presented or discussed herein. Also, and as 
used herein, "ammo acid sequence" refers to peptide or protein sequences and may refer to portions thereof. In addition, 
5 the term "ammo acid sequence of the present invention" is synonymous with the phrase "polypeptide sequence of the 
present invention". Also, the term "nucleotide sequence of the present invention" is synonymous with the phrase "poly- 
nucleotide sequence of the present invention". 

Detailed aspects of the invention 

[0012] According to one aspect of the present invention, there is provided an isolated and/or purified polynucleotide 
comprising one or more of: 

(a) a polynucleotide encoding the polypeptide as set forth in SEQ ID NO: 2; 
»5 (0) a polynucleotide comprising a nucleotide sequence of SEQ ID NO: 1 ; 

(O a polynucleotide encoding the polypeptide expressed by the DNA contained m NCIMB 41065; 

<d> a polynucleotide comprising a nucleotide sequence that has at least 70% identity to the polynucleotide of any 

cnc of (a) to (c); 

(O d polynucleotide comprising a nucleotide sequence which is capable of hybridising to the polynucleotide of any 
so cnc of (a) to (d); 

(f) a complement to the polynucleotide of any one of (a) to (e): or 

(g) a polynucleotide fragment of the polynucleotide of any one of (a) to (f). 

[0013] Preferably, the polynucleotide comprises a nucleotide sequence that has at least 75% identity to the polynu- 
2s cieotide of any one of (a) to (c). More preferably, the polynucleotide comprises a nucleotide sequence that has at least 
80% identity to the polynucleotide of any one of (a) to (c). Even more preferably, the polynucleotide comprises a nu- 
cleotide sequence that has at least 85% identity to the polynucleotide of any one of (a) to (c). Yet more preferably, the 
polynucleotide comprises a nucleotide sequence that has at least 90% identity to the polynucleotide of any one of (a) 
to (c) More preferably, the polynucleotide comprises a nucleotide sequence that has at least 95% identity to the poly- 
-30 nucleotide of any one of (a) to (c). Most preferably, the polynucleotide comprises a nucleotide sequence that has at 
least 98% identity to the polynucleotide of any one of (a) to (c). 

[0014] The polynucleotide described above preferably encodes a G-protein coupled receptor (GPCR). 
[0015] The present invention also provides a polynucleotide probe or primer comprising at least 15 contiguous nu- 
cleotides of the polynucleotide described above. 

35 [001 6] The present invention yet further provides a vector comprising the polynucleotide described above. 

[001 7] According to a further aspect of the present invention, there is provided a host cell transformed or transfected 
with the vector described above. Preferably, the host cell is a mammalian, insect, fungal, bacterial or yeast cell. 
[0018] According to a further aspect of the present invention, there is provided the transcribed RNA product of the 
polynucleotide described above. There is also provided an RNA molecule or a fragment thereof which is antisense m 

+o relation to the RNA product and is capable of hybridising thereto. 

[0019] There is yet further provided a ribozyme or zinc finger protein capable of binding to the polynucleotide de- 
scribed above 

[0020] According to yet a further aspect of the present invention, there is provided a process for producing a polypep- 
tide or fragment thereof comprising cuitunng said host cell under conditions sufficient for the expression of said polypep- 
-o tide or fragment. Preferably, said polypeptide or fragment is expressed at the surtace of said cell. The process preferably 
further includes recovering the polypeptide or fragment from the culture. 

[0021 ] There is also provided by the present invention a process for producing cells capable of expressing a polypep- 
tide or fragment thereof comprising transforming or transfecting cells with the vector described above. 
[0022] According to a further embodiment of the present invention , there are provided cells produced by the process 
50 described above. There is also provided a membrane preparation of said cells. 

[0023] According to another aspect of the present invention, there is provided a polypeptide comprising: 

(a) a polypeptide having the deduced amino acid sequence translated from the polynucleotide sequence m SEQ 
ID NO: 1 and variants, fragments, homologues. analogues and derivatives thereof; 
55 (b) a polypeptide of SEQ ID NO: 2 and variants, fragments, homologues, analogues and derivatives thereof: or 

(c) a polypeptide encoded by the cDNA of NCIMB 41065 and variants, fragments, homologues. analogues and 
derivatives thereof 
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[0024] There is also provided by the present invention an antibody agamst the polypeptide describee above 
[0025] The present invention yet further provides a compound, which modulates the polypeptide describee above 
Preferably, the compound antagonises or selectively antagonises the polypeptide Alternatively, the compound ago- 
nises the polypeptide. 

5 [0026] Also provided by the present invention is a pharmaceutical composition comprising the antibody or compound 

described above and one or more pharmaceutical^ acceptable earners, diluents, adjuvants or excipients. 

[0027] Acco r dmg to another aspect of the present invention, there is provided a method for identifying a compound. 

which binds to and modulates the polypeptide described above comprising contacting said polypeptide with a candidate 

compound and determining whether modulation occurs. 
w [0028] Preferably, said method comprises: 

(a) contacting a compound with cells expressing the polypeptide described above on their cell surface, said 
polypeptiae being associated with a second component capable of providing a detectable signal in response to 
tne binding of a 

7 5 compound to said polypeptide; said contacting being under conditions sufficient to permit binding of compounds 

to the polypeptide; and (b) identifying a compound capable of polypeptide binding by detecting the signal produced 
by said second component. 

[0029] Alternatively, said method comprises: 

20 

(a) contacting (i) a detectable first component known to bind to the polypeptide described above and (ii) a corn- 
pound, with cells expressing the above polypeptide on their cell surface, said polypeptide being associated with 
a second component capable of providing a detectable signal in response to the binding of a compound to said 
polypeptide; said contacting being under conditions sufficient to permit binding of compounds to the polypeptide; 

25 and 

(b) determining whether the first component binds to the polypeptide by detecting the absence or otherwise of a 
signal generated from the interaction of the first component with the polypeptide. 

[0030] The compound identified by any of the above methods preferably binds to and (i) antagonises or selectively 
30 antagonises the polypeptide described above, or (ii) agonises the polypeptide described above. 

[0031] As GPCRs are involved in signal transduction, modulators (e.g. agonists or antagonists) of the polypeptide 
of the present invention can find use in interfering in the signal transduction process. 

[0032] Therefore, according to yet another embodiment of the present invention, there is provided the antibody, 
compound or composition described above for use as a pharmaceutical. 

35 [0033] Such antibodies, compounds and compositions, which can modulate the polypeptide of the present invention, 
can therefore find use in the therapeutic areas which concern aspects of signal transduction. Therapeutically usefully 
areas include, but are not limited to, obesity, diabetes and metabolic disease, neurological disease, psychotherapeutics, 
urogenital disease, reproduction and sexual medicine, inflammation, cancer, tissue repair, dermatology, skin pigmen- 
tation, photoageing, frailty, osteoporosis, cardiovascular disease, gastrointestinal disease, antiinfection, allergy and 

40 respiratory disease, sensory organ disorders, sleep disorders and hairloss. 

[0034] Accordingly, there is also provided the use of the compound described above in the manufacture of a medi- 
cament for the treatment of a patient having need to modulate the polypeptide described above. Preferably, the treat- 
ment is for a patient having need to antagonise or selectively antagonise the polypeptide. Alternatively, the treatment 
is for a patient having need tc agonise the polypeptide. 

45 [0035] According to yet a further aspect of the invention, there is provided a method for the treatment of a patient 
having need to modulate the polypeptide described above comprising administering to the patient a therapeutically 
effective amount of the above-described compound. Preferably, said method is for the treatment of a patient having 
need to antagonise or selectively antagonise the polypeptide. Alternatively, said method is for the treatment of a patient 
having need to agonise the polypeptide. 

so [0036] Preferably, said compound is a polypeptide and a therapeutically effective amount of the compound is ad- 
ministered by providing to the patient DNA encoding said compound and expressing said compound in vivo. 
[0037] There is also provided by the present invention use of the antibody described above in the manufacture of a 
medicament for the treatment of a patient having need to modulate the polypeptide described above. Preferably, said 
method is for the treatment of a patient having need to antagonise or selectively antagonise the polypeptide. Alterna- 

55 tively, said method is for the treatment of a patient having need to agonise the polypeptide 

[0038] Yet further provided by the present invention is a method for the treatment of a patient having need to modulate 
the polypeptide described above, comprising administering to the patient a therapeutically effective amount of the 
antibody described above. Preferably, said method is for the treatment of a patient having need to antagonise or se- 
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lectively antagonise the polypeptide. Alternatively, said method is for the treatment of a patient having need to agonise 
the polypeptide. 

[0039] According to yet a further aspect of the present invention, there are provided cells genetically engineered ex 
vivo or in vivo to express, overexpress, underexpress or to exhibit targeted insertion or deletion of the polypeptide of 
5 the present invention. 

PFI-001 POLYPEPTIDE 

[0040] As explained above, the present invention relates to a novel GPCR - which has been internally designated 

w PFI-001 - and to a nucleotide sequence encoding same. The present invention also relates to the use of the novel 
nucleic acid and amino acid sequences in the diagnosis and treatment of disease. The present invention also relates 
to the use of the novel nucleic acid and amino acid sequences to evaluate and/or to screen for agents that can modulate 
the GPCR. The present invention further relates to genetically engineered host cells that comprise or express the novel 
nucleic acid and amino acid sequences to evaluate and/or to screen for agents that can modulate the GPCR 

15 [0041] The PFI-001 polypeptide may be the same as the naturally occurring form - for this aspect, preferably the 
PFI-001 polypeptide is the non-native amino acid sequence (i.e. it is not present m its natural environment) - or is a 
variant, nomoiogue, fragment or derivative thereof, in addition, or in the alternative the PFI-001 polypeptide is isolated 
PFI-00 1 polypeptide and/or purified PFI-001 polypeptide. The PFI-001 polypeptide can be obtainable from or produced 
by any suitable source, whether natural or not, or it may be synthetic, semi-synthetic or recombinant. 

20 [0042] The PFI-001 coding sequence may be the same as the naturally occurring form - for this aspect, preferably 
the PFI-001 coding sequence is the non-native nucleotide sequence (i.e. it is not present in its natural environment) - 
or is a variant, homologue, fragment or derivative thereof. In addition, or in the alternative, the PFI-001 coding sequence 
is an isolated PFI-001 coding sequence and/or a purified PFI-001 coding sequence. The PFI-001 coding sequence 
can be obtainable from or produced by any suitable source, whether natural or not, or it may be synthetic, semi-synthetic 

25 or recombinant. 

PFI-001 POLYPEPTIDE AND SCREENING 

[0043] The PFI-001 polypeptide and/or its coding sequence and/or a sequence capable of hybridising thereto is/are 
30 useful for testing the selectivity of drug candidates between different GPCRs. 

[0044] It has been demonstrated (herein) that PFI-001 is most closely similar to neurotensin receptors and that PFI- 
001 encodes a novel GPCR whose ligand is likely to be a peptide. 

[0045] Drugs that modulate the novel PFI-001 receptor will therefore be likely to modulate signal transduction proc- 
esses. It is therefore likely that modulators of the PFI-001 receptor may be useful for the treatment of many different 
35 disorders associated with signal transduction that will most likely include, but is not limited to, obesity, diabetes and 
metabolic disease, neurological disease, psychotherapeutics, urogenital disease, reproduction and sexual medicine, 
inflammation, cancer, tissue repair, dermatology, skin pigmentation, photoageing, frailty, osteoporosis, cardiovascular 
disease, gastrointestinal disease, antiinfection, allergy and respiratory disease, sensory organ disorders, sleep disor- 
ders and hairloss. 

40 [0046] Thus, the PFI-001 polypeptide and/or its coding sequence and/or a sequence capable of hybridising thereto 
may be useful for screening drug candidates for the treatment of diseases associated with signal transduction, in 
addition, it is believed that PFI-001 polypeptide and/or its coding sequence and/or a sequence capable of hybridising 
thereto may be useful for screening drug candidates for the treatment of diseases such as those described above 
[0047] Either or both of the nucleotide sequences coding for the PFI-001 polypeptide or the PFI-001 polypeptide 

45 itself may be used to screen for agents that can affect GPCR activity. In particular, the nucleotide sequence coding for 
the PFI-001 receptor itself may be used to screen for agents that can antagonise GPCR activity. In addition, the nu- 
cleotide sequence coding for PFI-001 polypeptide or the PFI-001 polypeptide itself may be used to screen for agents 
that selectively affect GPCR activity, such as selectively antagonise the PFI-001 receptor. 

50 POLYPEPTIDE OF THE PRESENT INVENTION 

[0048] The torm "polypeptide" - which is interchangeable with the term "protein" - includes singlc-cham polypeptide 
molecules as well as multiple-polypeptide complexes where individual constituent polypeptides are linked by covalent 
or non-covalent means. 

55 [0049] Preferably, the polypeptide of the present invention is a single-chain polypeptide. 

[0050] Polypeptides of the present invention may be in a substantially isolated form It will be understood that the 
polypeptide may be mixed with carriers or diluents which will not interfere with the intended purpose of the polypeptide 
and still be regarded as substantially isolated. A polypeptide of the present invention may also be m a substantially 
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purified form, .n which case it will generally comprise the polypeptide in a preparation m which more than 90% e g 
95%, 98°o or 99°o of the polypeptide in the preparation is a polypeptide of the present invention Polypeptides of the 
present invention may be modified for example oy :he addition of histidme residues to assist their purification 
[0051] Polypeptides of the present invention may be produced by synthetic means (e.g. as described by Geysen et 

5 at. 1996) or recombinant^, as described below 

[0052] In a preferred embodiment the ammo acid sequence per $e of the present invention does not cove r the native 
PFI-0O1 receptor according to the present invention when it is in its natural environment and when it has been expressed 
by its native nucleotide coding sequence which is also in its natural environment and when that nucleotide sequence 
is under the control of its native promoter which is also in its natural environment. For ease of reference, we have called 

>c this preferred embodiment the "non-native ammo acid sequence". 

[0053] The terms "variant", "homoiogue", "fragment", "analogue" or "derivative" in relation to the amino acid sequence 
for the polypeptide of the present invention include any substitution of, variation of, modification of replacement of, 
deletion of or addition of one (or more) amino acid from or to the sequence providing the resultant polypeptide has 
G^CR activity, preferably being at least as biologically active as the polypeptide shown in attached SEQ ID NO: 2. in 

T 5 particular tne term "homoiogue" covers homology with respect to structure and/or function. With respect to sequence 
homology, there is at least 70%, preferably at least 75%, more preferably at least 80%, more preferably at least 85%. 
more preferably at least 90%, more preferably at least 95% homology to the sequence shown in SEQ ID NO: 2. Most 
preferably there is at least 98% homology to the sequence shown in SEQ ID NO: 2. 

[0054] Typically, for the variant, homoiogue or fragment of the present invention, the types of amino acid substitutions 
20 that cou'd be made should maintain the hydrophobicity/hydrophilicity of the amino acid sequence. Amino acid substi- 
tutions may be made, for example from 1 , 2 or 3 to 10, 20 or 30 substitutions provided that the modified sequence 
retains the ability to act as a GPCR in accordance with the present invention. Amino acid substitutions may include 
the use of non-naturally occurring analogues. 

[0055] The amino acid sequence of the present invention may be produced by expression of a nucleotide sequence 

25 coding for same in a suitable expression system. 

[0056] In addition, or in the alternative, the protein itself could be produced using chemical methods to synthesize a 
PFl-001 polypeptide, in whole or in part. For example, peptides can be synthesized by solid phase techniques, cleaved 
from the resin, and purified by preparative high performance liquid chromatography (e.g. Creighton (1983) Proteins 
Structures and Molecular Principles, WH Freeman and Co., New York, NY, USA). The composition of the synthetic 

30 peptides may be confirmed by amino acid analysis or sequencing (e.g. the Edman degradation procedure). 

[0057] Direct peptide synthesis can be performed using various solid-phase techniques (Roberge JY et at Science 
Vol 259 1 995 202-204) and automated synthesis may be achieved, for example, using the ABI 431 A Peptide Synthe- 
sizer (Perkm Elmer) in accordance with the instructions provided by the manufacturer. Additionally, the amino acid 
sequence of PFl-001 , or any part thereof, may be altered during direct synthesis and/or combined using chemical 

35 methods with a sequence from other subunits, or any part thereof, to produce a variant polypeptide. 

[0058] In another embodiment of the invention, a PFl-001 natural, modified or recombinant amino acid sequence 
may be ligated to a heterologous sequence to encode a fusion protein. For example, for screening of libraries for 
compounds and peptide agonists and antagonists of PFl-001 GPCR activity, it may be useful to encode a chimeric 
PFl-001 protein expressing a heterologous epitope that is recognised by a commercially available antibody. A fusion 

•*o protein may also be engineered to contain a cleavage site located between a PFl-001 sequence and the heterologous 
protein sequence, so that the PFl-001 may be cleaved and purified away from the heterologous moiety. 
[0059] PFl-001 may also be expressed as a recombinant protein with one or more additional polypeptide domains 
added to facilitate protein purification. Such purification facilitating domains include, but are not limited to, metal chelat- 
ing peptides such as histidine-tryptophan modules that allow purification on immobilised metals (Porath J, Protein Expr 

^5 Purif Vol 3 1992 p263-28i ), protein A domains that allow purification on immobilised immunoglobulin, and the domain 
utilised m the FLAGS extension/affinity purification system (Immunex Corp, Seattle, WA, USA). The inclusion of a 
cleavable linker sequence such as Factor XAor enterokinase (Invitrogen, San Diego, CA ( USA) between the purification 
domain and PFl-001 is useful to facilitate purification. 

[0060] A specific amino acid sequence of PFl-001 is shown in SEQ ID NO: 2. However, the present invention en- 
50 compasses amino acid sequences encoding other GPCRs which would include amino acid sequences having at least 

70% identity (preferably at least 75%, more preferably at least 80%, more preferably at least 85%, more preferably at 

least 90%, more preferably at least 95%, most preferably at least 98% identity) to that specific amino acid sequence. 

[0061 ] Polypeptides of the present invention also include fragments of the present amino acid sequence and variants 

thereof. Suitable fragments will be at least 5, e.g. at least 10, 12, 15 or 20 ammo acids in size. 
55 [0062] Polypeptides of the present invention may also be modified to contain one or more (e.g. at least 2, 3, 5, or 

10) substitutions, deletions or insertions, including conserved substitutions. These aspects are discussed in a later 

section 
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NUCLEOTIDE SEQUENCE OF THE PRESENT INVENTION 

[0063] The term "nucleotide sequence" as used herein refers to an oligonucleotide sequence or polynucleotide se- 
quence, and variants, homologues. fragments, analogues and derivatives thereof (such as portions thereof). The nu- 
5 cieotide sequence may be DNA or RNA which may be of genomic or synthetic or recombinant origin which may be 
double-stranded or single-stranded whether representing the sense or antisense strand. 
[0064] Preferably, the term "nucleotide sequence" means DNA. 

[0065] More preferably, trie term "nucleotide sequence" means DNA prepared by use of recombinant DNA techniques 
(i.e. recombinant DNA). 

10 [0066] In a preferred embodiment, the nucleotide sequence perseoi the present invention does not cover the native 
nucleotide coding sequence according to the present invention in its natural environment when it is under the control 
of its native promoter which is also in its natural environment. For ease of reference, we have called this preferred 
embodiment the "non-native nucleotide sequence". 

[0067] The nucleotide sequences of the present invention may include within them synthetic or modified nucleotides. 

15 a number of different types of modification to oligonucleotides are known in the art, These include methylphosphonate 
and phosphorothioate backbones, addition of acndme or polylysine chains at the 3' and/or 5' ends of the molecule. For 
the purposes of the present invention, it is to be understood that the nucleotide sequences described herein may be 
modified by any method available in the art. Such modifications may be carried out in to enhance the in vivo activity 
or life span of nucleotide sequences of the present invention. 

20 [0068] The present invention also encompasses nucleotide sequences that are complementary to the sequences 
presented herein, or any variant, homologue, analogue, fragment or derivative thereof. If the sequence is complemen- 
tary to a fragment thereof then that sequence can be used a probe to identify similar coding sequences in other or- 
ganisms, etc. 

[0069] The present invention also encompasses nucleotide sequences that are capable of hybridising to the se- 
25 quences presented herein, or any variant, homologue, analogue, fragment or derivative thereof. 

[0070] The present invention also encompasses nucleotide sequences that are capable of hybridising to the se- 
quences that are complementary to the sequences presented herein, or any variant, homologue, analogue, fragment 
or derivative thereof 

[0071] The term "variant" also encompasses sequences that are complementary to sequences that are capable of 
30 hydndising to the nucleotide sequences presented herein. 

[0072] Preferably, the term "variant" encompasses sequences that are complementary to sequences that are capable 
of hydndising under stringent conditions (e.g. 65°C and O.lxSSC {1xSSC = 0.15 M NaCI. 0.015 Na 3 Citrate pH 7.0}) 
to the nucleotide sequences presented herein. 

[0073] The present invention also relates to nucleotide sequences that can hybridise to the nucleotide sequences 
35 of the present invention (including complementary sequences of those presented herein). 

[0074] The present invention also relates to nucleotide sequences that are complementary to sequences that can 
hybridise to the nucleotide sequences of the present invention (including complementary sequences of those presented 
herein). 

[0075] Also included within the scope of the present invention are polynucleotide sequences that are capable of 
•*o hybridising to the nucleotide sequences presented herein under conditions of intermediate to maximal stringency. 
[0076] In a preferred aspect, the present invention covers nucleotide sequences that can hybridise to the nucleotide 
sequence of the present invention, or the complement thereof, under stringent conditions (e.g. 65°C and O.lxSSC) 
[0077] Exemplary nucleic acids can alternatively be characterised as those nucleotide sequences which encode a 
PFi-001 protein and hybridise to the DNA sequence shown in SEQ ID NO: 1 . Preferred are such sequences encoding 
-*s PFl-001 which hybridise under high-stringency conditions to the sequence shown in SEQ ID NO: 1 or the complement 
thereof. 

[0078] Advantageously, the invention provides nucleic acid sequences which are capable of hybridising, under strin- 
gent conditions, to a fragment of the sequence shown in the SEQ ID NO; 1 or the complement thereof. Preferably, the 
fragment is between 1 5 and 50 bases in length. Advantageously, it is about 25 bases in length. 

5 o [0079] The terms "variant", "homologue", "analogue", "derivative" or "fragment" in relation to the nucleotide sequence 
coding for the preferred polypeptide of the present invention include any substitution of, variation of, modification of, 
rcptaccmcnt of, deletion of or addition of one (or more) nucleic acid from or to the sequence providing the resultant 
nucleotide sequence codes for or is capable of coding for an polypeptide having PFI-001 receptor activity, preferably 
being at least as biologically active as the polypeptide encoded by the sequence shown in SEQ ID NO: 1 . In particular, 

55 the term "homologue" covers homology with respect to structure and/or function providing the resultant nucleotide 
sequence codes for or is capable of coding for a polypeptide having activity as a PFI-001 GPCR. With respect to 
sequence homology, preferably there is at least 70%, preferably at least 75%, more preferably at least 80%, more 
preferably at least 85%. more preferably at least 90%, more preferably at least 95% homology to a nucleotide sequence 
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ccdirg for the ammo acid sequence snown m SEQ ID NO 2 Most preferably there is at least 98% homology to a 
nucleotide sequence coding for the ammo acid sequence shown in SEQ ID NO 2 With respect to sequence homology 
there is at least 70% . preferably at least 75%, more preferably at least 80%, more preferably at least 85% more 
preferably at least 90% more preferably at least 95% homology to the nucleotide sequence shown in SEQ ID NO: 1 
5 Most preferably there is at least 98% homology to the nucleotide sequence shown m SEQ ID NO: 1 . 

[0080] As indicated, tne oresent invention relates to a DNA sequence (preferably a cDNA sequence) encoding PFI- 
001 in particular, the present invention relates to cDNA sequences encoding PFl-001 . 

[0081] The present invention also relates to DNA segments comprising the DNA sequence shown m SEQ ID NO: 1 
or allelic variations thereof. 

w [0082] The present invention also relates to polypeptides produced by expression m a host cell into which has been 
incorporated the foregoing DNA sequences or allelic variations thereof. 

[0083] The present invention also relates provides DNA comprising the DNA sequence shown in SEQ ID NO: 1 or 
allelic variations thereof. 

[0084] The present invention also relates to non-native DNA comprising the DNA sequence shown in SEQ ID NO: 
1 $ 1 or allelic variations thereof . 

[0085] A highly preferred aspect of the present invention relates to recombinant DNA comprising the DNA sequence 
shown m SEQ ID NO: 1 or allelic variations thereof. 

[0086] Polynucleotides of the present invention include nucleic acid sequences encoding the polypeptides of the 
present invention. It will be appreciated that a range of different polynucleotides encode a given amino acid sequence 
20 as a consequence of the degeneracy of the genetic code. 

[0087] By knowledge of the amino acid sequences set out herein it is possible to devise partial and full-length nucleic 
acid sequences such as cDNA and/or genomic clones that encode the polypeptides of the present invention For 
example, polynucleotides of the present invention may be obtained using degenerate polymerase chain reaction (PCR) 
which will use primers designed to target sequences encoding the amino acid sequences presented herein . The primers 
25 will typically contain multiple degenerate positions. However, to minimise degeneracy, sequences will be chosen that 
encode regions of the amino acid sequences presented herein containing amino acids such as methionine which are 
coded for by only one triplet. In addition, sequences will be chosen to take into account codon usage in the organism 
whose nucleic acid is used as the template DNA for the PCR procedure. PCR will be used at stringency conditions 
lower than those used for cloning sequences with single sequence (non-degenerate) primers against known sequenc- 
ed es. 

[0088] Nucleic acid sequences obtained by PCR that encode polypeptide fragments of the present invention may 
then be used to obtain larger sequences using hybridisation library screening techniques. For example a PCR clone 
may be labelled with radioactive atoms and used to screen a cDNA or genomic library from other species, preferably 
other mammalian species. Hybridisation conditions will typically be conditions of medium to high stringency (for exam- 

35 pie 0 03M sodium chloride and 0.03M sodium citrate at from about 50°C to about 60°C). 

[0089] Degenerate nucleic acid probes encoding all or part of the amino acid sequence may also be used to probe 
cDNA and/or genomic libraries from other species, preferably other mammalian species. However, it is preferred to 
carry out PCR techniques initially to obtain a single sequence for use in further screening procedures. 
[0090] In accordance with the present invention, polynucleotide sequences which encode PFl-001 , fragments of the 

•*o polypeptide, fusion proteins or functional equivalents thereof, may be used to generate recombinant DNA molecules 
that direct the expression of PFl-001 in appropriate host cells. Due to the inherent degeneracy of the genetic code, 
other DNA sequences which encode substantially the same or a functionally equivalent amino acid sequence, may be 
used to clone and express PFl-001 As will be understood by those of skill m the art, it may be advantageous to produce 
PFl-001 -encoding nucleotide sequences possessing non-naturaiiy occurring codons. Codons preferred by a particular 

-is prokaryotic or eukaryotic host (Murray E et al (1989) Nuc Acids Res 17 477-508) can be selected, for example, to 
increase the rate of PFl-001 expression or to produce recombinant RNA transcripts having desirable properties, such 
as a longer half-life, than transcripts produced from naturally occurring sequence. 

[0091] Polynucleotide sequences of the present invention obtained using the techniques described above may be 
used to obtain further homologous sequences and variants using the techniques described above. They may also be 
50 modified for use in expressing the polypeptides of the present invention in a variety of host cells systems, for example 
to optimise codon preferences for a particular host cell in which the polynucleotide sequences are being expressed. 
Other sequence changes may be desired in order to introduce restriction enzyme recognition sites, or to alter the 
property or function of the polypeptides encoded by the polynucleotides. 

[0092] Altered PFl-001 polynucleotide sequences which may be used in accordance with the invention include de- 
55 letions, insertions or substitutions of different nucleotide residues resulting in a polynucleotide that enccdes the same 
or a functionally equivalent PFl-001 The protein may also have deletions, insertions or substitutions of ammo acid 
residues which produce a silent change and result in a functionally equivalent PFl-001 . Deliberate ammo acid substi- 
tutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity. and/or the 
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amphipathic nature of the residues as long as the biological activity of PFl-001 is retained For example, negatively 
charged ammo acids include aspartic acid and glutamic acid; positively charged amino acids include lysine and argmme; 
and ammo acids with uncharged polar head groups having similar hydrophilicity values include leucine, isoieucme. 
valine, glycine, alanine, asparagme. glutamine. serine, threonine, phenylalanine, and tyrosine. 

5 [0093] Included within the scope of the present invention are alleles of PFl-001 . As used herein, an "allele" or "allelic 
sequence" is an alternative form of PFl-001 , Alleles result from a mutation, i.e. a change in the nucleic acid sequence, 
and generally produce altered mRNAs or polypeptides whose structure or function may or may not be altered. Any 
given gene may have none, one or many allelic forms. Common mutational changes which give rise to alleles are 
generally ascribed to deletions, additions or substitutions of amino acids. Each of these types of changes may occur 

to alone, or in combination with the others, one or more times in a given sequence. 

[0094] The nucleotide sequences of the present invention may be engineered in order to alter a PFl-001 coding 
sequence for a variety of reasons, including but not limited to, alterations which modify the cloning, processing and/or 
expression of the gene product. For example, mutations may be introduced using techniques which are well known m 
the art, e.g. site-directed mutagenesis to insert new restriction sites, to alter glycosylation patterns or to change codcn 

15 preference. 

[0095] Polynucleotides of the present invention may be used to produce a primer, e.g. a PCR primer, a pnmer for 
an alternative amplification reaction, a probe e.g. labelled with a revealing label by conventional means using radio- 
active or non-radioactive labels, or the polynucleotides may be cloned into vectors. Such primers, probes and other 
fragments will be at least 15, preferably at least 20, for example at least 25, 30 or 40 nucleotides in length, and are 

20 also encompassed by the term polynucleotides of the present invention as used herein. 

[0096] Polynucleotides or primers of the present invention may carry a revealing label. Suitable labels include radi- 
oisotopes such as 32 P or 35 S t enzyme labels, or other protein labels such as biotin. Such labels may be added to 
polynucleotides or primers of the present invention and may be detected using by techniques known in the art. 
[0097] Polynucleotides such as a DNA polynucleotide and primers according to the present invention may be pro- 

25 duced recombinantly, synthetically, or by any means available to those of skill in the art. They may also be cloned by 
standard techniques. 

[0098] In general, primers will be produced by synthetic means, involving a step-wise manufacture of the desired 
nucleic acid sequence one nucleotide at a time. Techniques for accomplishing this using automated techniques are 
readily available in the art. 

30 [0099] Longer polynucleotides will generally be produced using recombinant means, for example using PCR cloning 
techniques This will involve making a pair of primers (e.g. of about 15-30 nucleotides) to a region of the nucleotide 
sequence which it is desired to clone, bringing the primers into contact with mRNA or cDNA obtained from a eukaryotic 
or prokaryotic cell, performing a polymerase chain reaction under conditions which bring about amplification of the 
desired region, isolating the amplified fragment (e.g. by purifying the reaction mixture on an agarose gel) and recovering 

35 the amplified DNA. The primers may be designed to contain suitable restriction enzyme recognition sites so that the 
amplified DNA can be cloned into a suitable cloning vector. 

[01 00] DNA molecules may be modified to increase intracellular stability and half-life. Possible modifications include, 
but are not limited to, the addition of flanking sequences of the 5' and/or 3' ends of the molecule or the use of phos- 
phorothioate or 2 O-methyl rather than phosphodiesterase linkages within the backbone of the molecule. 

to [01 01 ] As mentioned earlier, the present invention also relates to nucleotide sequences that are capable of hybridising 
to all or part of the sequence shown in SEQ ID NO: 1 or an allelic variation thereof. These nucleotide sequences may 
be used in antisense techniques to modify PFl-001 expression. Alternatively, these sequences (or portions thereof) 
can be used as a probe, or for amplifying all or part of such sequence when used as a PCR pnmer. 
[0102] In addition to the recombinant DNA sequences, genomic sequences are also of utility in the context of drug 

^5 discovery. It may be valuable to inhibit the mRNA transcription of a particular isoform rather than to inhibit its translated 
protein. This may be true with PFl-001 , if there are splice variants and wherein those different splice variants may be 
transcribed from different promoters. 

[0103] Another utility of the invention is that the DNA sequences, once known, give the information needed to design 
assays to specifically detect isoforms or splice variants. Isoform-speofic PCR primer pairs are but one example of an 

50 assay that depends completely on the knowledge of the specific DNA sequence of the isoform or splice variant. Such 
an assay allows detection of mRNA for the isoform to access the tissue distribution and biological relevance of each 
isoform to a particular disease state. It also allows identification of cell lines that may naturally express only one isoform 
- a discovery that might obviate the need to express recombinant genes. If specific PFl-001 isoforms are shown to be 
associated with a particular disease state, the invention would be valuable in the design of diagnostic assays to detect 

55 the presence of isoform mRNA. 

[0104] An abnormal level of nucleotide sequences encoding a PFl-001 receptor in a biological sample may reflect 
a chromosomal aberration, such as a nucleic acid deletion or mutation. Accordingly, nucleotide sequences encoding 
a PFl-001 receptor provide the basis for probes which can be used diagnostically to detect chromosomal aberrations 



9 



BNSDOC1D <EP . 1090969A1 I > 



EP 1 090 989 A1 



such as deletions, mutations or chromosomal translocations m 'he gene encoding PFI-001 PFI-001 gene expression 
may oe altered tr SLCh disease stares or there may be a chromosomal aberration presen: m the regicn of the gere 
ercoding PFI-OO" . 

[0105] in an alternative embodiment of the invention, the coding sequence of PFI-001 could be synthesized, in who e 
5 or m part, using chemical methods well known m the art (see Caruthers MH et a/ (1980) Nuc Acids Res Symp Ser 
215-23. Horn T et a/ (1980) Nuc Acids Res Symp Ser 225-232). 

NATURALLY OCCURRING 

10 [0106] As used herein "naturally occurring" refers to a PFI-001 with an amino acid sequence found in nature. 
ISOLATED/PURIFIED 

[0107] As used herein, the terms "isolated" and "purified" refer to molecules, either nucleic or ammo acid sequences. 
*5 that are removed from their natural environment and isolated or separated from at least one other component with 
which they are naturally associated 

BIOLOGICALLY ACTIVE 

20 [0108] As used herein "biologically active" refers to a PFI-001 according to the present invention - such as a recom- 
binant PFI-001 - having a similar structural function (but not necessarily to the same degree), and/or similar regulatory 
function (but not necessarily to the same degree), and/or similar biochemical function (but not necessarily to the same 
degree) and/or immunological activity (but not necessarily to the same degree) of the naturally occurring PFI-001. 
Specifically, a PFI-001 of the present invention has the ability to act as a GPCR, which is one of the characteristic 

25 activities of the PFI-001 polypeptide of the present invention. 

IMMUNOLOGICAL ACTIVITY 

[0109] As used herein, "immunological activity" is defined as the capability of the natural, recombinant or synthetic 
30 PFI-001 or any oligopeptide thereof, to induce a specific immune response in appropriate animals or cells and to bind 
with specific antibodies 

DERIVATIVE 

35 [0110] The term "derivative'* as used herein in relation to the amino acid sequence includes chemical modification 
of a PFI-001 . Illustrative of such modifications would be replacement of hydrogen by an alkyl, acyl, or amino group. 

ANALOGUE 

jo [01 1 1 ] The term "analogue" as used herein in relation to the amino acid sequence (or the coding sequence thereof) 
includes chemical modification of a PFI-001 or the coding sequence thereof. Illustrative of such modifications would 
be replacement of natural ammo acid residues or natural nucleotides with non-natural amino acid residues (e.g. D- 
ammo acids, beta-aianme, hydroxyproline) or non-natural nucleotides (e.g. mosine, dernethyi-cytidme). 

J5 DELETION 

[01 12] As used herein a "deletion" is defined as a change m either nucleotide or amino acid sequence in which one 
or more nucleotides or ammo acid residues, respectively, are absent. 

50 INSERTION/ADDITION 

[0113] As used heroin an "insertion" or "addition" is a change in a nucleotide or amino acid sequence which has 
resulted m the addition of one or more nucleotides or ammo acid residues, respectively, as compared to the naturally 
occurring PFI-001 . 

55 

SUBSTITUTION 

[0114] As used herein "substitution" results from the replacement of one or more nucleotides or amino acids by 
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different nucleotides or ammo acids, respectively 
HOMOLOGUE 

5 [0115] The term "hornologue" with respect to the nucleotide sequence of the present invention and the ammo acid 
sequence of the present invention may be synonymous with allelic variations of the sequences. 
[0116] In particular, the term "homology" as used herein may be equated with the term "identity". Here, sequence 
homology with respect to the nucleotide sequence of the present invention and the amino acid sequence of the present 
invention can be determined by a simple "eyeball" comparison (i.e. a strict comparison) of any one or more of the 

to sequences with another sequence to see if that other sequence has at least 70% identity to the nucleotide and ammo 
acid sequences. Relative sequence homology (i.e. sequence identity) can also be determined by commercially avail- 
able computer programs that can calculate percentage (%) homology between two or more sequences. Typical exam- 
ples of such computer programs are FASTA or BLAST. 

[01 17] Percentage (%) homology may be calculated over contiguous sequences, i.e. one sequence is aligned with 
15 the other sequence and each amino acid in one sequence directly compared with the corresponding ammo acid in the 
other sequence, one residue at a time. This is called an "ungapped" alignment. Typically, such ungapped alignments 
are performed only over a relatively short number of residues (for example less than 50 contiguous amino acids). 
[0118] Although this is a very simple and consistent method, it fails to take into consideration that, for example, m 
an otherwise identical pair of sequences, one insertion or deletion will cause the following amino acid residues to be 
20 put out of alignment, thus potentially resulting in a large reduction in % homology when a global alignment is performed. 
Consequently, most sequence comparison methods are designed to produce optimal alignments that take into con- 
sideration possible insertions and deletions without penalising unduly the overall homology score. This is achieved by 
inserting "gaps" in the sequence alignment to try to maximise local homology. 

[0119] However, these more complex methods assign "gap penalties" to each gap that occurs in the alignment so 
25 that, for the same number of identical amino acids, a sequence alignment with as few gaps as possible - reflecting 

higher relatedness between the two compared sequences - will achieve a higher score than one with many gaps. 

"Affine gap costs" are typically used that charge a relatively high cost for the existence of a gap and a smaller penalty 

for each subsequent residue in the gap. This is the most commonly used gap scoring system High gap penalties will 

of course produce optimised alignments with fewer gaps. Most alignment programs allow the gap penalties to be 
30 modified. However, it is preferred to use the default values when using such software for sequence comparisons. For 

example when using the GCG Wisconsin Bestf it package (see below) the default gap penalty for amino acid sequences 

is -12 for a gap and -4 for each extension 

[0120] Calculation of maximum % homology therefore firstly requires the production of an optimal alignment, taking 
into consideration gap penalties. A suitable computer program for carrying out such an alignment is the GCG Wisconsin 

35 Bestfit package (University of Wisconsin, U.S.A.; Devereux et ai, 1984, Nucleic Acids Research 12:387). Examples 
of other software that can perform sequence comparisons include, but are not limited to, the BLAST package (see 
Ausubel et ai., 1999 ibid - Chapter 18), FASTA (Altschul et ai, 1990, J. Mol. Biol., 403-410) and the GENEWORKS 
suite of comparison tools. Both BLAST and FASTA are available for off-line and on-line searching (see Ausubel et ai. 
1999 ibid, pages 7-58 to 7-60). However, for some applications it is preferred to use the GCG Bestfit program. 

•*o [0121] Although the final % homology can be measured in terms of identity, in some cases, the alignment process 
itself is typically not based on an all-or-nothing pair comparison. Instead, a scaled similarity score matrix is generally 
used that assigns scores to each pairwise comparison based on chemical similarity or evolutionary distance An ex- 
ample of such a matrix commonly used is the BLOSUM62 matrix - the default matrix for the BLAST suite of programs. 
GCG Wisconsin programs generally use either the public default values or a custom symbol comparison table if supplied 

4 5 (see user manual for further details). It is preferred to use the public default values for the GCG package, or in the 
case of other software, the default matrix, such as BLOSUM62 

[0122] Once the software has produced an optimal alignment, it is possible to calculate % homology, preferably % 
sequence identity. The software typically does this as part of the sequence comparison and generates a numerical 
result. 

so [0123] As indicated, for some applications, sequence homology (or identity) may be determined using any suitable 
homology algorithm, using for example default parameters. For a discussion of basic issues in similarity searching of 
sequence databases, see Altschul era/. (1994) Nature Genetics 6:119-129 For some applications, the BLAST algo- 
rithm is employed, with parameters set to default values. The BLAST algorithm is described in detail at http://www. 
ncbi.nih.gov/BLAST/blast_help.html. Advantageously, "substantial homology", when assessed by BLAST, equates to 

55 sequences which match with an EXPECT value of at least about e-7, preferably at least about e-9 and most preferably 
e-1 0 or lower The default threshold for EXPECT in BLAST searching is usually 10 

[0124] Should Gap Penalties be used when determining sequence identity, then preferably the following parameters 
are used: 
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GAP PENALTY 


10 


10 
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0.1 





[0125] Other computer program methods to determine identity and similarity between the two sequences include 
but are not limited to the GCG program package (Devereux et aL, 1984 Nucleic Acids Research 12: 387) and FASTA 
(Atschul et al 1990 J Molec Biol 403-410). 



25 PROFILE HIDDEN MARKOV MODELS 

[0126] The profile of a family of sequences can be modelled in a probabilistic way using hidden Markov models 
(HMMs) (Durbin, R , Eddy, S , Krogh. A and Mitchison, G (1998), Biological Sequence Analysis: Probabilistic models 
of proteins and nucleic acids; Cambridge University Press. Cambridge, UK.; Eddy, S R. (1 998), Profile hidden Markov 

30 models; Bioinformatics, 14: 755-763). HMMs were originally developed for use in speech recognition and have since 
been used for a range of signal processing applications. A signal is modelled as a sequence of elements from a finite 
set. In speech recognition this set consists of the sounds that make up the language and the aim is to deduce what 
words the sounds represent. A protein sequence from a particular family can be thought of in the same way. The 
sequence is a series of elements from the set of 20 ammo acids and the requirement is to determine what protein 

35 family the sequence could represent. 

[0127] Profile HMMs can be constructed from a multiple sequence alignment of known family members. Each position 
in the alignment corresponds to a state, where states are "match", "insert" or "delete". The whole alignment can there- 
fore be thought of as a linear chain, or path, of interconnecting states connected by transition probabilities. These 
states "emir amino acid residues with different probabilities (based on the range of residues seen in the alignment 

•*o together with substitution data and "pseudocount" methods - to correct for limited data in the initial alignment). The 
probability of a particular sequence matching the model can be calculated as a combination of the transition probabilities 
between states and the emission probabilities for that amino acid sequence. 

[0128] Once a good alignment of known family members has been obtained, HMMs can be constructed with little 
manual intervention (compared with profiles) and are a more sensitive method for detecting remote homologues. The 

•45 pfam database is a library of profile HMMs for many protein families (Bateman, A.. Bimey, E.. Durtoin, R., Eddy, S R., 
Finn, R.D. and Sonhammer, E L L. (1999), Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority 
of proteins. Nucleic Acids Research, 27 260-262). It contains models constructed from carefully edited seed alignments 
and models automatically generated by an iterative procedure. This complete automation of the process has the dis- 
advantage that any errors that are made will be rapidly propagated. 

50 [0129] For the present analysis, profile HMMs were constructed for every major GPCR sub-family by assembling all 
known members and aligning them with CLUSTAL. Each sub-family model was incorporated into a database and the 
novel GPCRs searched with the entire database to identify the best match. 

POLYPEPTIDE VARIANTS AND DERIVATIVES 

55 

[0130] The terms "variant" or "derivative" in relation to the amino acid sequences of the present invention includes 
any substitution of. variation of, modification of, replacement of, deletion of or addition of one (or more) amino acids 
from or to the sequence providing the resultant amino acid sequence has PFI-001 activity, preferably having at least 
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the same activity as the polypeptide presented in SEQ ID NO: 2 

[0131] The sequences of the present invention may be modified for use in the present invention. Typically, modifi- 
cations are made that maintain the PFl-00 1 activity of the sequence. Amino acid substitutions may be made, for example 
from 1 . 2 or 3 to 1 0, 20 or 30 substitutions provided that the modified sequence retains PFI-001 activity. Ammo acid 
5 substitutions may include the use of non-naturally occurring analogues, for example to increase blood plasma haif-hfe 
of a therapeutically administered polypeptide. 

[01 32] Conservative substitutions may be made, for example according to the Table below. Ammo acids in the same 
block m the second column and preferably in the same line in the third column may be substituted for each other: 



10 
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20 

[0133] As indicated above, proteins of the invention are typically made by recombinant means, for example as de- 
scribed herein, and/or by using synthetic means using techniques well known to the skilled person such as solid phase 
synthesis. Variants and derivatives of such sequences include fusion proteins wherein the fusion proteins comprise 
at least the amino acid sequence of the present invention being linked (directly or indirectly) to another ammo acid 

25 sequence. These other ammo acid sequences - which are sometimes referred to as fusion protein partners - will typically 
impart a favourable functionality - such as to aid extraction and purification of the amino acid sequence of the present 
invention Examples of fusion protein partners include glutathione-S-transferase (GST), 6xHis, GAL4 (DNA binding 
and/or transcriptional activation domains) and p-galactosidase. It may also be convenient to include a proteolytic cleav- 
age site between the fusion protein partner and the protein sequence of the present invention so as to allow removal 

30 of the latter. Preferably the fusion protein partner will not hinder the function of the protein of the present invention. 

POLYNUCLEOTIDE VARIANTS AND DERIVATIVES 

[0134] The terms "variant" or "derivative" in relation to the nucleotide sequence of the present invention include any 
35 substitution of variation of, modification of, replacement of, deletion of or addition of one (or more) nucleic acid from 
or to the sequence providing the resultant nucleotide sequence codes for a polypeptide having PFI-001 activity, pref- 
erably having at least the same activity as the sequence presented in SEQ ID NO 2. 

[0135] As indicated above, with respect to sequence homology, there is at least 70%, preferably at least 75%, more 
preferably at least 80%, more preferably at least 85%, more preferably at least 90%, more preferably at least 95% 

40 homology to the nucleotide sequence shown in SEQ ID NO: 1 . Most preferably there is at least 98% homology to the 
nucleotide sequence shown in SEQ ID NO: 1 . Nucleotide homology comparisons may be conducted as described 
above. For some applications, a preferred sequence comparison program is the GCG Wisconsin Bestfit program de- 
scribed above. The default scoring matrix has a match value of 10 for each identical nucleotide and -9 for each mis- 
match. The default gap creation penalty is -50 and the default gap extension penalty is -3 for each nucleotide. 

45 [0136] As used herein, the terms "variant", "homologue", "fragment" and "derivative" embrace allelic variations of 
the sequences. 

[0137] The term "variant" also encompasses sequences that are complementary to sequences that are capable of 
hydridising to the nucleotide sequences presented herein. 

50 HYBRIDISATION 

[0138] The term "hybridisation" as used herein shall include "the process by which a strand of nucleic acid joins with 
a complementary strand through base pairing" (Coombs J (1994) Dictionary of Biotechnology, Stockton Press, New 
York, NY, USA) as well as the process of amplification as carried out in PCR technologies as described in Dieffenbach 
55 CW and GS Dveksler (1995, PCR Primer, a Laboratory Manual, Cold Spring Harbor Press, Plainview, NY, USA) 

[0139] Hybridisation conditions are based on the melting temperature (Tm) of the nucleic acid binding complex, as 
taught in Berger and Kimmel (1987. Guide to Molecular Cloning Techniques. Methods in Enzymology, Vol 152, Aca- 
demic Press, San Diego, CA, USA), and confer a defined "stringency" as explained below. 
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[0140] Stringency of hybridisation re'ers to conditions under which polynucleic acids hybrids are stable Such con- 
ditions are evident to those of ordinary skill in the field As known to those of skill m the art. the stability of hybrids is 
reflected in the melting temperature (Tm) of the hyond wmch decreases approximately 1 to 1 5°C with every 1° 0 de- 
crease m sequence homology In general the stability of a hybrid is a function of sodium ion concentration and tem- 
5 perature. Typically, tne hybridisation reaction is performed under conditions of higher stringency, followed by washes 
of varying stringency 

[0141] As used herein, high stringency refers to conditions that permit hybridisation of only those nucleic acid se- 
quences that form stable hybnds in 1 M Na + at 65-68°C. 

[0142] Maximum stringency typically occurs at about Tm-5°C (5°C below the Tm of the probe), 
to [0143] High stringency occurs at about 5°C to 10°C below the Tm of the probe High stringency conditions can be 
provided, for example, by hybridisation in an aqueous solution containing 6x SSC. 5x Denhardt's, i°o SDS (sodium 
dodecyl suichatei, 0.1 Na+ pyrophosphate and 0.1 mg/ml denatured salmon sperm DNA as non-specific competitor 
Following n/Dridisation, high stringency washing may be done in several steps, with a final wash (about 30 mm) at the 
hybridisation temperature m 0.2-0. 1x SSC, 0.1% SDS. 
*5 [0144] Moderate, or intermediate, stringency typically occurs at about 10°C to 20°C below the Tm of the probe. 
[0145] Low stringency typically occurs at about 20°C to 25°C below the Tm of the probe. 

[0146] As will be understood by those of skill in the art, a maximum stringency hybridisation can be used to identify 
or detect identical polynucleotide sequences while an intermediate (or low) stringency hybridisation can be used to 
identify or detect similar or related polynucleotide sequences. 

20 [0147] Moderate stringency refers to conditions equivalent to hybridisation in the above-described solution but at 
about 60-62°C. In that case the final wash is performed at the hybridisation temperature in 1x SSC, 0.1% SDS. 
[0148] Low stringency refers to conditions equivalent to hybridisation in the above-described solution at about 
50-52°C. In that case, the final wash is performed at the hybridisation temperature in 2x SSC, 0.1% SDS. 
[01 49] It is understood that these conditions may be adapted and duplicated using a variety of buffers, e .g. formamide- 

25 based buffers, and temperatures. Denhardt's solution and SSC are well known to those of skill in the art as are other 
suitable hybridisation buffers (see, e.g., Sambrook, etal., eds. (1989) Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory Press, New York, NY. USA or Ausubel, et ai, eds. (1990) Current Protocols in Molecular 
Biology, John Wiley & Sons, Inc.). Optimal hybridisation conditions have to be determined empirically, as the length 
and the GC content of the probe also play a role. 

30 [0150] Polynucleotides of the invention capable of selectively hybridising to the nucleotide sequences presented 
herein, or to their complement, will be generally at least 70%, preferably at least 75%, more preferably at least 80%, 
more preferably at least 85%, more preferably at least 90%, more preferably at least 95%. most preferably at least 
98% homologous to the corresponding nucleotide sequences presented herein over a region of at least 20, preferably 
at least 25 or 30, for instance at least 40, 60 or 100 or more contiguous nucleotides. 

35 [0151] The term "selectively hybridisable" means that the polynucleotide used as a probe is used under conditions 
where a target polynucleotide of the invention is found to hybridize to the probe at a level significantly above background. 
The background hybridization may occur because of other polynucleotides present, for example, in the cDNA or ge- 
nomic DNA library being screened. In this event, background implies a level of signal generated by interaction between 
the probe and a non-specific DNA member of the library which is less than 10-fold, preferably less than 100-fold as 

40 intense as the specific interaction observed with the target DNA. The intensity of interaction may be measured, for 
example, by radiolabelling the probe, e.g. with 32 P. 

[0152] In a preferred aspect, the present invention covers nucleotide sequences that can hybridise to any one or 
more of the nucleotide sequences of the present invention under stringent conditions (e.g. 65°C and O.ixSSC (ixSSC 
= 0.15 M NaCl, 0 015 M Na 3 Citrate pH 7.0}). 
4 5 [0153] Where the polynucleotide of the present invention is double-stranded, both strands of the duplex, either indi- 
vidually or in combination, are encompassed by the present invention. Where the polynucleotide is single-stranded, it 
is to be understood that the complementary sequence of that polynucleotide is also included within the scope of the 
present invention. 

[01 54] Polynucleotides which are not 1 00% homologous to the sequences of the present invention but fall within the 
50 scope of the invention can be obtained in a number of ways. Other variants of the sequences described herein may 
be obtained, for example, by probing DNA libraries made from a range of individuals, for example individuals from 
different populations. In addition, other viral/bacterial, or cellular homologues particularly cellular homologucs found 
m mammalian ceils (e.g. bovine, ovine, porcine, equine, rodent and primate cells), may be obtained and such homo- 
logues and fragments thereof in general will be capable of selectively hybridising to the sequence shown in SEQ ID 
55 NO; 1 . Such sequences may be obtained by probing cDNA libraries made from, or genomic DNA libraries derived from , 
other animal species, and probing such libraries with probes comprising all or part of the sequence shown in SEQ ID 
NO: 1 under conditions of medium to high stringency. Similar considerations apply to obtaining species homologues 
and allelic variants of the polypeptide or nucleotide sequences of the invention. 
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[0155] Variants and strain/species homologues may also be obtained using degenerate PGR which will use primers 
designed to target sequences within the variants and homologues encoding conserved ammo acid sequences within 
the sequences of the present invention. Conserved sequences can be predicted, for example, by aligning the ammo 
acid sequences from several vanants/homologues. Sequence alignments can be performed using computer software 
known m the art. For example the GCG Wisconsin PileUp program is widely used. 

[0156] The primers used in degenerate PCR will contain one or more degenerate positions and will be used at 
stringency conditions lower than those used for cloning sequences with single sequence primers against known se- 
quences. 

[0157] Alternatively, such polynucleotides may be obtained by site-directed mutagenesis of characterised sequenc- 
es This may be useful where, for example, silent codon changes are required to sequences to optimise codon pref- 
erences for a particular host cell in which the polynucleotide sequences are being expressed. Other sequence changes 
may be desired in order to introduce restriction enzyme recognition sites, or to alter the property or function of the 
polypeptides encoded by the polynucleotides. 

[01 58] Polynucleotides of the invention may be used to produce a primer, e.g. a PCR primer, a primer for an alternative 
anpi hcation reaction, a probe e.g. labelled with a revealing label by conventional means using radioactive or non- 
radioactive labels, or the polynucleotides may be cloned into vectors. Such primers, probes and other fragments will 
be at least 1 5, preferably at least 20, for example at least 25, 30 or 40 nucleotides in length, and are also encompassed 
by the term polynucleotides of the invention as used herein. 

[0159] Polynucleotides such as DNA polynucleotides and probes according to the invention may be produced re- 
ccmbtnantly, synthetically, or by any means available to those of skill in the art. They may also be cloned by standard 
techniques 

[0160] in general, primers will be produced by synthetic means, involving a step wise manufacture of the desired 
nucleic acid sequence one nucleotide at a time. Techniques for accomplishing this using automated techniques are 

readily available in the art. 

[0161] Longer polynucleotides will generally be produced using recombinant means, for example using PCR cloning 
techniques This will involve making a pair of primers (e.g. of about 15 to 30 nucleotides) flanking a region of the 
sequence which it is desired to clone, bringing the primers into contact with mRNA or cDNA obtained from an animal 
or human cell, performing a polymerase chain reaction under conditions which bring about amplification of the desired 
region, isolating the amplified fragment (e.g. by purifying the reaction mixture on an agarose gel) and recovering the 
amplified DNA. The primers may be designed to contain suitable restriction enzyme recognition sites so that the am- 
plified DNA can be cloned into a suitable cloning vector. 

REGULATORY SEQUENCES 

35 [0162] Preferably, the polynucleotide of the present invention is operably linked to a regulatory sequence which is 
capable of providing for the expression of the coding sequence, such as by the chosen host cell. By way of example, 
the present invention covers a vector comprising the polynucleotide of the present invention operably linked to such 
a regulatory sequence, i.e. the vector is an expression vector. 

[0163] The term "operably linked" refers to a juxtaposition wherein the components described are in a relationship 
permitting them to function in their intended manner. A regulatory sequence "operably linked" to a coding sequence is 
ligated in such a way that expression of the coding sequence is achieved under condition compatible with the control 
sequences 

[0164] The term "regulatory sequences" includes promoters and enhancers and other expression regulation signals. 

[0165] The term "promoter" is used in the normal sense of the art, e.g. an RNA polymerase binding site 
4 $ [0166] Enhanced expression of the polynucleotide encoding the polypeptide of the present invention may also be 

achieved by the selection of heterologous regulatory regions, e.g. promoter, secretion leader and terminator regions. 

which serve to increase expression and, if desired, secretion levels of the protein of interest from the chosen expression 

host and/or to provide for the inducible control of the expression of the polypeptide of the present invention. 

[0167] Preferably, the nucleotide sequence of the present invention may be operably linked to at least a promoter. 
so [oi 68] Aside from the promoter native to the gene encoding the polypeptide of the present invention, other promoters 

may be used to direct expression of the polypeptide of the present invention. The promoter may be selected for its 

efficiency in directing the expression of the polypeptide of the present invention in the desired expression host. 

[0169] in another embodiment, a constitutive promoter may be selected to direct the expression of the desired 

polypeptide of the present invention Such an expression construct may provide additional advantages since it circum- 
55 vents the need to culture the expression hosts on a medium containing an inducing substrate. 

[0170] Examples of suitable promotors would be LTR, SV40 and CMV in mammalian systems; E coli lac or trp >n 

bacterial systems; baculovirus polyhedron promoter (polh) in insect systems and other promoters that are known to 

control expression in eukaryotic and prokaryotic cells or their viruses. 
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[0171] Examples of strong constitutive and/or mcucible promoters which are preferred for use n fjngal expression 
hosts are those which are obtairab ! e from the fungal genes for xylanase (xlnA), phytase. ATP-synthetase. subunit 9 
(olfC. tnose phosphate isomerase (tpi), alcohol dehydrogenase (AahA), u-amylase (amy), amyloglucosidase ■ AG- 
from the gtaA gene), acetamidase (amdS) and glyceraldehyde-3-phosphate dehydrogenase (gpd) promoters. 
5 [01 72] Examples of strong yeast promoters are those obtainable from the genes for alcohol dehydrogenase, actase. 
3-ohosphoglycerate kinase and tnosephosphate isomerase 

[0173] Examples of strong bacterial promoters are the a-amylase and SP02 promoters as well as promoters from 
extracellular protease genes. 

[0174] Hybrid promoters may also be used to improve inducible regulation of the expression construct. 

10 [0175] The promoter can additionally include features to ensure or to increase expression in a suitable host For 
example, the features can be conserved regions such as a Pnbnow Box or a TATA box The promoter may even contain 
other sequences to affect (such as to maintain, enhance or decrease) the levels of expression of the nucleotide se- 
quence of the present invention. For example, suitable other sequences include the Shi -intron or an ADH intron. Other 
sequences include inducible elements - such as temperature, chemical, light or stress inducible elements. Also, suitable 

75 elements to enhance transcription or translation may be present. An example of the latter element is the TMV 5' signal 
sequence (see Sleat, Gene 217 [1987] 217-225; and Dawson, Plant Mol. Biol. 23 [1993] 97). 

[0176] The expression vector may also contain sequences which act on the promoter to amplify expression For 
example, the SV40, CMV, and polyoma cis-acting elements (enhancer) and a selectable marker can provide a phe- 
notypic trait for selection (e.g. dihydrofolate reductase or neomycin resistance for mammalian cells or amplicillin/tet- 
20 racyclin resistance for E. coii). Selection of the appropriate vector containing the appropriate promoter and selection 
marker is well within the level of those skilled in the art. 

CONSTRUCTS 

25 [0177] The term "construct" - which is synonymous with terms such as "conjugate", "cassette' and "hybrid" - includes 
the nucleotide sequence according to the present invention directly or indirectly attached to a promoter. An example 
of an indirect attachment is the provision of a suitable spacer group such as an intron sequence, such as the Shi -intron 
or the ADH intron, intermediate the promoter and the nucleotide sequence of the present invention. The same is true 
for the term "fused" in relation to the present invention which includes direct or indirect attachment. In each case, the 

30 terms do not cover the natural combination of the nucleotide sequence coding for the protein ordinarily associated with 
the wild-type gene promoter and when they are both in their natural environment. 

[0178] The construct may even contain or express a marker which allows for the selection of the genetic construct 
in, for example, mammalian, yeast, insect, fungal or bacterial cells. Various markers exist which may be used, such 
as, for example, those that provide for antibiotic resistance to G41 8, hygromycin, bleomycin, kanamycin and gentamy- 

35 cin. 

[0179] Preferably the construct of the present invention comprises at least the nucleotide sequence of the present 
invention operably linked to a promoter. 

VECTORS 

40 

[0180] The term "vector" includes expression vectors and transformation vectors and shuttle vectors. 

[0181] The term "expression vector" means a construct capable of in vivo or in vitro expression. 

[0182] The term "transformation vector means a construct capable of being transferred from one entity to another 

entity - which may be of the same species or may be of a different species, if the construct is capable of being transferred 

from one species to another - such as from a viral vector such as MMLV or FIV to a human or mammalian primary cell 

or cell line, then the transformation vector is sometimes referred to as a "shuttle vector". 

[0183] A targe variety of expression systems may be used in different hosts. For example, episomal, chromosomal 
and virus-derived systems (e.g. vectors derived from bacterial plasmids, bacteriophage papova virus such as SV40, 
vaccinia virus, adenovirus, and retrovirus). 
so [0184] The DNA sequence can be inserted into the vector by a variety of techniques. In general the DNA sequence 
ts inserted into an appropriate restriction endonuclease site by procedures known in the art and deemed to be within 
the scope of those skilled in the art. The DNA sequence in the expression vector is linked operatively to appropriate 
control sequences that direct mRNA synthesis (i.e the promoter). 

[01 85] The vectors of the present invention may be transformed into a suitable host cell as described below to provide 
55 for expression of a polypeptide of the present invention. Thus, in a further aspect, the invention provides a process for 
preparing polypeptides according to the present invention which comprises cultivating a host cell transformed or trans- 
fected with an expression vector as described above under conditions to provide for expression by the vector of a 
coding sequence encoding the polypeptides, and recovering the expressed polypeptides. 
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[0186] The vectors may be. for example, plasmid, virus or bacteriophage (phage) vectors provided with an origin of 
replication, optionally a promoter for the expression of the polynucleotide and optionally a regulator of the promoter. 
[0187] The vectors of the present invention may contain one or more selectable marker genes. The most suitable 
selection systems for industrial micro-organisms are those formed by the group of selection markers which do not 

5 require a mutation in the host organism. Examples of funqat selection markers are the genes for acetamidase (amdS), 
ATP synthetase, subunit 9 (oiiC), orotidine-S'-phosphate-decarboxylase (pvrA), phleomyctn and benomyl resistance 
(ben A). Examples of non-fungal selection markers are the bacterial G418 resistance gene (this may also be used m 
mammalian cells, yeast, but not in filamentous fungi), the ampicillin resistance gene (E. coli), the neomycin resistance 
gene (mammalian cells) and the E. coli uidA gene, coding for ^-glucuronidase (GUS). 

10 [01 88] Vectors may be used in vitro, for example for the production of RN A or used to transfect or transform a host cell. 
[0189] Thus, polynucleotides of the present invention can be incorporated into a recombinant vector (typically a 
replicabie vector), for example a cloning or expression vector. The vector may be used to replicate the nucleic acid in 
a compatible host cell. Thus, in a further embodiment, the invention provides a method of making polynucleotides of 
the present invention by introducing a polynucleotide of the present invention into a replicabie vector, introducing the 

15 vector into a compatible host cell, and growing the host cell under conditions which bring about replication of the vector. 
The vector may be recovered from the host cell. Suitable host cells are described below in connection with expression 
vectors. 

[0190] The present invention also relates to the use of genetically engineered host cells expressing a PFl-001 
polypeptide or variant, homoiogue, fragment, analogue or derivative thereof in screening methods for the identification 

20 of modulators (e.g. antagonists or agonists) of PFl-001 Such genetically engineered host cells could be used to screen 
peptide libraries or organic molecules capable of modulating PFl-001 activity. Modulators (e.g. antagonists) of the PFl- 
001 polypeptide, such as antibodies, peptides or small organic molecules will provide the basis for pharmaceutical 
compositions for the treatment of diseases associated with, for example, PFl-001 . Such modulators (e.g. antagonists) 
can be administered alone or in combination with other therapeutics for the treatment of such diseases. 

25 [0191] The present invention also relates to expression vectors and host cells comprising polynucleotide sequences 
encoding PFl-001 or a variant, homoiogue, fragment, analogue or derivative thereof for the in vivo or in vitro production 
of PFl-001 protein or to screen for agents that can affect PFl-001 expression or activity. 

TISSUE 

30 

[0192] The term "tissue" as used herein includes tissue per se and organ 
HOST CELLS 

35 [0193] The term "host cell" - in relation to the present invention - includes any cell that could comprise the nucleotide 
sequence coding for the recombinant protein according to the present invention and/or products obtained therefrom, 
wherein a promoter can allow expression of the nucleotide sequence according to the present invention when present 
in the host cell. 

[0194] Thus, a further embodiment of the present invention provides host cells transformed or transfected with a 
■*o polynucleotide of the present invention. Preferably said polynucleotide is carried in a vector for the replication and 
expression of said polynucleotide. The cells will be chosen to be compatible with the said vector and may, for example, 
be prokaryotic (for example, bacterial cells), or eukaryotic (i.e. mammalian, fungal, insect and yeast ceils). 
[0195] Introduction of polynucleotides into host cells can be effected by methods as described in Sambrook, et ai, 
eds. (1989) Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, New York, NY, USA. These 
•*s methods include, but are not limited to, calcium phosphate transfection, DEAE-dextran-mediated transfection , cationic 
lipid-mediated transfection, electroporation, transvection, microinjection, transduction, scrape loading, and ballistic in- 
troduction. 

[0196] Examples of representative hosts include, bacterial cells (e g E. coli, Steptomyces): fungal cells such as yeast 
cells and Aspergillus; insect cells such as Drosophila S2 and Spodoptera SF9 cells; animal cells such as CHO, COS, 
so HEK, HeLa. and 3T3 cells The selection of the appropriate host is deemed to be within the scope of those skilled m 
the art. 

[0197] Depending on the nature of the polynucleotide encoding the polypeptide of the present invention, and/or the 
desirability for further processing of the expressed protein, eukaryotic hosts such as yeasts or other fungi may be 
preferred. In general, yeast cells are preferred over fungal cells because they are easier to manipulate. However, some 
55 proteins are either poorty expressed or secreted from the yeast cell, or in some cases are not processed properly (e. 
g. hyperglycosylation in yeast). In these instances, a different fungal host organism should be selected 
[0198] Examples of suitable expression hosts within the scope of the present invention are fungi such as Aspergillus 
species (such as those described in EP-A-01 84438 and EP-A-0284603) and Trichoderma species; bacteria such as 
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Escherichia species Streptomyces species and Pseudomonas species: and yeasts such as Kluyveromyces species 
isjcn as those described m EP-A-0096430 and EP-A-0301670) and Saccharomyces species. 3y way of example, 
typical expression hosts may be selected from Aspergillus niger. Aspergillus nigervar. tubtgenis, Aspergillus niger van 
awamon, Aspergillus acuieaus. Aspergillus nidulans, Aspergillus orvzae, Trichoderma reesei. Kluyveromyces lactis, 
Schizosaccharomyces pombe, Pichia pastor is and Saccharomyces cerevisiae. 

[0199] The use of suitable host cells - such as mammalian, yeast, insect and fungal host cells - may provide for post- 
translational modifications (e.g. myhstoylation, glycosylation, truncation, lapidation and tyrosine, serine or threonine 
phosphorylation) as may be needed to confer optimal biological activity on recombinant expression products of the 
present invention. 

ORGANISM 

[0200] The term "organism" in relation to the present invention includes any organism, except man, that could com- 
prise the nucleotide sequence coding for the recombinant protein according to the present invention and/or products 
obtained therefrom, wherein a promoter can allow expression of the nucleotide sequence according to tne present 
invention when present in the organism. Examples of organisms may include a fungus, yeast or protozoan. 
[0201] The term "transgenic organism" in relation to the present invention includes any organism, except man. that 
ccmpnses the nucleotide sequence coding for the protein according to the present invention and/or products obtained 
therefrom, wherein the promoter can allow expression of the nucleotide sequence according to the present invention 
withm the organism Preferably the nucleotide sequence is incorporated in the genome of the organism. 
[0202] The term "transgenic organism" does not cover the native nucleotide coding sequence according to the 
present invention m its natural environment when it is under the control of its native promoter which is also in its natural 
environment In addition, the present invention does not cover the native protein according to the present invention 
when it is m its natural environment and when it has been expressed by its native nucleotide coding sequence which 
is also m its natural environment and when that nucleotide sequence is under the control of its native promoter which 
is also m its natural environment. 

[0203] Therefore, the transgenic organism of the present invention includes an organism comprising any one of, or 
combinations of, the nucleotide sequence coding for the amino acid sequence according to the present invention, 
constructs according to the present invention (including combinations thereof), vectors according to the present inven- 
tion, plasmids according to the present invention, cells according to the present invention, and tissues according to 
the present invention or the products thereof. The transformed cell or organism could prepare acceptable quantities 
of the desired compound which would be easily retrievable from the cell or organism. 

TRANSFORMATION OF HOST CELLS/HOST ORGANISMS 

35 

[0204] As indicated earlier, the host organism can be a prokaryotic or a eukaryotic organism. An example of a suitable 
prokaryotic host is E. coli. Teachings on the transformation of prokaryotic hosts are well documented in the art, for 
example see Sambrook era/. (Molecular Cloning: A Laboratory Manual, 2nd edition, 1989, Cold Spring Harbor Labo- 
ratory Press. New York, NY, USA) and Ausubel et at. (Current Protocols in Molecular Biology (1995), John Wiley & 
•to Sons, Inc.). 

[0205] In a preferred embodiment, the transformed host is a mammalian cell or, for example, an insect cell, wherein 
introduction of polynucleotides into said host cells can be effected by methods as described in, for example, Sambrook 
et ai (Molecular Cloning: A Laboratory Manual, 2nd edition, 1989, Cold Spring Harbor Laboratory Press, New York, 
NY, USA) These methods include, but are not limited to, calcium phosphate transfection. DEAE-dextran-mediated 
^5 transfection, cationic lipid-mediated transfection, electroporation, transvection, microinjection, transduction, scrape 
loading, and ballistic introduction. 

[0206] In another embodiment the transgenic organism can be a yeast. In this regard, yeast have also been widely 
used as a vehicle for heterologous gene expression. The species Saccharomyces cerevisiae has a long history of 
industrial use, including its use for heterologous gene expression. Expression of heterologous genes in Saccharomyces 
50 cerev j Siae has been reviewed by Goodey era/. (1987. Yeast Biotechnology, D R Berry et al, eds, pp 401 -429, Allen 
and Unwin, London) and by King et ai (1989, Molecular and Cell Biology of Yeasts, E F Walton and G T Yarronton, 
cds, pp 107-133, Blackie, Glasgow). 

[0207] For several reasons Saccharomyces cerevisiae is well suited for heterologous gene expression. First it is 
non-pathogenic to humans and it is incapable of producing certain endotoxins. Second, it has a long history of safe 
55 use following centuries of commercial exploitation for various purposes. This has led to wide public acceptability. Third, 
the extensive commercial use and research devoted to the organism has resulted in a wealth of knowledge about the 
genetics and physiology as well as large-scale fermentation characteristics of Saccharomyces cerevisiae. 
[0208] A review of the principles of heterologous gene expression in Saccharomyces cerevisiae and secretion of 
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gene products is given by E Hmchcliffe and E Kenny ("Yeast as a vehicle for the expression of heterologous genes" 
1993, Yeasts, Vol 5, Anthony H Rose and J Stuart Harrison, eds, 2nd edition, Academic Press Ltd,). 
[0209] Several types of yeast vectors are available, including integrative vectors, which require recombination with 
the host genome for their maintenance, and autonomously replicating plasmid vectors. 

5 [0210] In order to prepare tne transgenic Saccharomyces, expression constructs are prepared by inserting the nu- 
cleotide sequence of the present invention into a construct designed for expression in yeast. 
[0211] Several types of constructs used for heterologous expression have been developed. The constructs contain 
a promoter active in yeast fused to the nucleotide sequence of the present invention, usually a promoter of yeast origin, 
such as the GAL1 promoter, is used. Usually a signal sequence of yeast origin, such as the sequence encoding the 

w SUC2 signal peptide, is used. A terminator active in yeast ends the expression system. 

[0212] For the transformation of yeast several transformation protocols have been developed. For example, a trans- 
genic Saccharomyces according to the present invention can be prepared by following the teachings of Hmnen et af. 
(1978, Proceedings of the National Academy of Sciences of the USA, 75: 1929); Beggs, J D (1978, Nature, London, 
275:104); and Ito, H et ai (1983, J. Bacteriology 153:163-168). 

1 5 [021 3] The transformed yeast cells are selected using various selective markers. Among the markers used for trans- 
formation are a number of auxotrophic markers such as LEU2, HIS4 and TRPi, and dominant antibiotic resistance 
markers such as aminoglycoside antibiotic markers, e.g. G418. 

[0214] Thus, the present invention also provides a method of transforming a host cell with a nucleotide sequence 
shown in SEQ ID NO: 1 or a derivative, homologue, variant, analogue or fragment thereof. 

20 [021 5] Host cells transformed with a PFI-001 nucleotide coding sequence may be cultured under conditions suitable 
for the expression and recovery of the encoded protein (in cell membranes) from cell culture. The protein produced by 
a recombinant cell may be expressed on the cell surface, secreted or may be contained intracellularly depending on 
the sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 
PFI-001 coding sequences will generally enable expression in the cell membrane. Other recombinant constructions 

25 may join PFI-001 coding sequence to nucleotide sequence encoding a polypeptide domain which will facilitate protein 
purification/identification (Kroll DJ et ai (1993) DNA Cell Biol Vol 12 p44l-53, see also above discussion of vectors 
containing fusion proteins). 

GENETICALLY ENGINEERED or GENETICALLY MODIFIED 

30 

[021 6] A cell, preferably an animal cell, that is "genetically modified" is heterozygous or homozygous for a modification 
that is introduced into the cell, or into a progenitor cell, by genetic engineering. The standard methods of genetic 
engineering that are available for introducing the modification include homologous recombination, viral vector gene 
trapping, irradiation, chemical mutagenesis, and the transgenic expression of a nucleotide sequence encoding anti- 

35 sense RNA alone or in combination with catalytic ribozymes. Preferred methods for genetic modification are homolo- 
gous recombination and viral vector gene trapping which both modify an endogenous gene by inserting a foreign nucleic 
acid sequence into the gene locus. A nucleic acid sequence that is foreign to the gene is an exogenous sequence that 
is non-naturaliy occurring in the gene. This insertion of foreign DNA can occur within any region of the PFI-001 gene, 
e.g., m an enhancer, promoter, regulator region, non-coding region, coding region, intron, or exon. The most preferred 

•to method of genetic engineering is homologous recombination, in which the foreign nucleic acid sequence is inserted m 
a targeted manner either alone or in combination with a deletion of a portion of the endogenous gene sequence. 

FUNCTIONALLY DISRUPTED 

J 5 [0217] By a PFI-001 gene that is "functionally disrupted" is meant a PFI-001 gene that is genetically modified such 
that the cellular activity of the PFI-001 polypeptide encoded by the disrupted gene is decreased in cells that normally 
express the wild-type version of the PFI-001 gene. When the genetic modification effectively eliminates all wild-type 
copies of the PFI-001 gene in a cell (e.g., the genetically modified cell, preferably an animal cell, is homozygous for 
the PFI-001 gene disruption or the only wild-type copy of PFI-001 gene originally present is now disrupted), then the 

50 genetic modification results in a reduction in PFI-001 polypeptide activity (i.e. reduced receptor expression) as com- 
pared to an appropriate control cell that expresses the wild-type PFI-001 gene. This reduction in PFI-001 polypeptide 
activity (i o. reduced receptor expression) results from either reduced PFI-001 gene expression (i.e., PFI-001 mRNA 
levels are effectively reduced and produce reduced levels of PFI-001 polypeptide) and/or because the disrupted PFI- 
001 gene encodes a mutated polypeptide with reduced function or stability as compared to a wild-type PFI-001 polypep- 

55 tide. Preferably, the activity (i.e. reduced receptor expression) of PFI-001 polypeptide in the genetically modified cell 
is reduced to 50% or less of wild-type levels more preferably, to 25% or less, and, even more preferably, to 10% or 
less of wild-type levels. Most preferably, the PFI-001 gene disruption results in a null mutation 
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GENETICALLY MODIFIED ANIMAL CELL 

[0218] By a "genetically modified animal cell" containing a functionally disrupted PFl-001 gene is meant an animal 
cell including a human cell, created by genetic engineering to contain a functionally disrupted PFl-001 gene, as well 

5 as dauqnter cells that inherit the disrupted PFl-001 gene. These cells may be genetically modified in culture according 
to any standard method known m the art. As an alternative to genetically modifying the cells in culture, non-human 
mammalian cells may also be isolated from a genetically modified, non-human mammal that contains a PFl-001 gene 
disruption. The animal cells of the invention may be obtained from primary cell or tissue preparations as well as culture- 
adapted, tumorigenic, or transformed cell lines. These cells and cell lines are derived, for example, from endothelial 

to cells, epithelial cells, islets neurons and other neural tissue-derived cells, mesothelial cells, osteocytes. lymphocytes, 
chondrocytes, hematopoietic cells, immune cells, cells of the major glands or organs (e.g., liver, lung, heart, stomacn. 
pancreas, kidney, and skin), muscle cells (including cells from skeletal muscle, smooth muscle, and cardiac muscle), 
exocrine or endocrine cells, fibroblasts, and embryonic and other totipotent or pluhpotent stem cells {e.g., embryonic 
stem (ES) cells, ES-like cells, and embryonic germline (EG) cells, and other stem cells, such as progenitor cells and 

1 5 tissue-derived stem cells). The preferred genetically modified cells are ES cells, more preferably, mouse or rat ES 
cells, and, most preferably, human ES cells. 

[0219] A "homology region" used m a targeting vector for homologous recombination with a PFl-001 gene is related 
(i.e., complementary) to a portion of the PFl-001 gene or a sequence flanking the PFl-001 gene to a degree sufficient 
to allow hybridization to occur between the homology region and the PFl-001 gene sequence under standard low 
20 stringency conditions known in the art (e.g. , as described in Current Protocols in Human Genetics, unit 4 1 , John Wiley 
& Sons, New York. NY, 2000) 

[0220] By an "ES cell" or an M ES-like cell" is meant a pluhpotent stem cell derived from an embryo, from a primordial 
germ cell, or from a teratocarcinoma, that is capable of indefinite self renewal as well as differentiation into cell types 
that are representative of all three embryonic germ layers. 
25 [0221] By "reduced" is meant a statistically significant decrease (i.e., p<0.1). 

[0222] The genetically modified animal cells, including human cells, of the invention are heterozygous or homozygous 
for a modification that functionally disrupts the PFl-001 gene. The animal cells may be derived by genetically engineer- 
ing cells in culture, or, in the case of non-human mammalian cells, the cells may be isolated from genetically modified, 
non-human mammals. 

30 [0223] The PFl-001 gene locus is functionally disrupted by one of the several techniques for genetic modification 
known in the art. including chemical mutagenesis (Rinchik, Trends in Genetics 7: 15-21 . 1991 . Russell. Environmental 
& Molecular Mutagenesis 23 (Suppi. 24) 23-29. 1994), irradiation (Russell, supra), transgenic expression of PFl-001 
gene antisense RNA, either alone or in combination with a catalytic RNA ribozyme sequence (Luyckx et al. . Proc. Natl. 
Acad. Sci. 96: 12174-79, 1999; Sokol et al., Transgenic Research 5: 363-71 , 1996; Efrat et al., Proc. Natl. Acad. Sci. 

35 USA 91 : 2051-55, 1994; Larsson et al., Nucleic Acids Research 22: 2242-48, 1994) and, as further discussed below, 
the disruption of the PFl-001 gene by the insertion of a foreign nucleic acid sequence into the PFl-001 gene locus. 
Preferably, the foreign sequence is inserted by homologous recombination or by the insertion of a viral vector. Most 
preferably, the method of PFl-001 gene disruption is homologous recombination and includes a deletion of a portion 
of the endogenous PFl-001 gene sequence. 

40 [0224] The integration of the foreign sequence functionally disrupts the PFl-001 gene through one or more of the 
following mechanisms: by interfering with the PFl-001 gene transcription or translation process (e.g., by interfering 
with promoter recognition, or by introducing a transcription termination site or a translational stop codon into the PFl- 
001 gene); or by distorting the PFl-001 gene coding sequence such that it no longer encodes a PFl-001 polypeptiae 
with normal receptor function (e.g., by inserting a foreign coding sequence into the PFl-001 gene coding sequence, 

45 by introducing a frameshift mutation or amino acid(s) substitution, or, in the case of a double crossover event, by 
deleting a portion of the PFl-001 gene coding sequence that is required for expression of a functional receptor protein). 
[0225] To insert a foreign sequence into a PFl-001 gene locus in the genome of a cell, the foreign DNA sequence is 
introduced into the cell according to a standard method known in the art such as electroporation, calcium-phosphate 
precipitation, retroviral infection, microinjection, biolistics, liposome transfection, DEAE-dextran transfection, ortrans- 

50 ferrinfection (see, e.g., Neumann etal., EM BO J. 1 : 841-845, 1982; Potter etal., Proc. Natl. Acad. Sci USA 81 : 7161-65. 
1984; Chu et al., Nucleic Acids Res. 15: 1 31 1-26. 1987; Thomas and Capecchi, Cell 51: 503-12, 1987; Baum et ai.. 
Biotechmques 17: 1058-62, 1994; Bicwenga et al.. J. Neuroscience Methods 71 : 67-75, 1997; Zhang etal., Biotech- 
niques 15: 868-72, 1993; Ray and Gage, Biotechniques 13: 598-603, 1992; Lo. Mol. Cell. Biol. 3: 1803-14, 1983: 
Nickoloff et al.. Mol. Biotech. 10: 93-101, 1998; Linney et al., Dev. Biol. (Orlando) 213: 207-16 1999; Zimmer and 

55 Gruss, Nature 338: 150-153, 1989; and Robertson et al., Nature 323: 445-48, 1986) The preferred method for intro- 
ducing foreign DNA into a cell is electroporation 
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Homologous recombination 

[0226] The method of homologous recombination targets the PFI-001 gene for disruption by introducing a PFI-001 
gene targeting vector into a cell containing a PFI-001 gene. The ability of the vector to target the PFI-001 gene for 

5 disruption stems from using a nucleotide sequence in the vector that is homologous to the PFi-001 gene. This homology 
region facilitates hybridization between the vector and the endogenous sequence of the PFI-001 gene. Upon hybridi- 
zation, the probability of a crossover event between the targeting vector and genomic sequences greatly increases. 
This crossover event results in the integration of the vector sequence into the PFI-001 gene locus and the functional 
disruption of the PFI-001 gene. 

10 [0227] General principles regarding the construction of vectors used for targeting are reviewed in Bradley et al. 
(Biotechnol. 1 0: 534, 1 992). Two different exemplary types of vector can be used to insert DNA by homologous recom- 
bination an insertion vector or a replacement vector. An insertion vector is circular DNA which contains a region of 
PFI-001 gene homology with a double stranded break. Following hybridization between the homology region and the 
endogenous PFI-001 gene, a single crossover event at the double stranded break results in the insertion of the entire 

15 vector sequence into the endogenous gene at the site of crossover. 

[0228] The more preferred vector to use for homologous recombination is a replacement vector, which is colinear 
rather than circular. Replacement vector integration into the PFI-001 gene requires a double crossover event, i.e. 
crossing over at two sites of hybridization between the targeting vector and the PFI-001 gene. This double crossover 
eveni results in the integration of vector sequence that is sandwiched between the two sites of crossover into the PFI- 

20 001 gene and the deletion of the corresponding endogenous PFI-001 gene sequence that originally spanned between 
the two sites of crossover (see, e.g., Thomas and Capecchi et al., Cell 51 : 503-12, 1987; Mansour et al., Nature 336: 
348-52, 1988; Mansour et al., Proc. Natl. Acad. Sci. USA 87: 7688-7692, 1990; and Mansour, GATA7; 219-227, 1990). 
[0229] A region of homology in a targeting vector is generally at least 1 00 nucleotides in length. Most preferably, the 
homology region is at least 1 -5 kilobases (Kb) in length. Although there is no demonstrated minimum length or minimum 

25 degree of relatedness required for a homology region, targeting efficiency for homologous recombination generally 
corresponds with the length and the degree of relatedness between the targeting vector and the PFI-001 gene locus. 
In the case where a replacement vector is used, and a portion of the endogenous PFI-001 gene is deleted upon 
homologous recombination, an additional consideration is the size of the deleted portion of the endogenous PFI-001 
gene If this portion of the endogenous PFI-001 gene is greater than 1 Kb in length, then a targeting cassette with 

30 regions of homology that are longer than 1 Kb is recommended to enhance the efficiency of recombination. Further 
guidance regarding the selection and use of sequences effective for homologous recombination is described in the 
literature (see, e.g., Deng and Capecchi, Mol. Cell. Biol. 12: 3365-3371, 1992; Bollag et al.. Annu. Rev. Genet. 23: 
199-225, 1989; and Waldman and Liskay, Mol. Cell. Biol. 8: 5350-5357, 1988). 

[0230] A wide variety of cloning vectors may be used as vector backbones in the construction of PFI-001 gene 
35 targeting vectors, including pBluescript-related plasmids (e.g., Bluescript KS+11). pQE70, pQE60, pQE-9, pBS, pDlO, 
phagescript, phiPFI-001174, pBK Phagemid, pNH8A, pNHI6a, pNH18Z, pNH46A, ptrc99a, pKK223-3, pKK233-3, 
pDR540, andpRITS PWLNEO, pSV2CAT, pPFI-001T1 , pSG (Stratagene), pSVK3. PBPV, PMSG, and pSVL, pBR322 
and pBR322-based vectors, pBM9. pBR325, pKH47, pBR328, pHC79, phage Charon 28, pKB11, pKSV-10 ( pK19 
related plasmids, pUC plasmids, and the pGEM series of plasmids. These vectors are available from a variety of 
io commercial sources (e.g., Boehringer Mannheim Biochemicals, Indianapolis, IN; Qiagen, Valencia, CA; Stratagene, 
La Jolla, CA; Promega, Madison Wl; and New England Biolabs, Beverly, MA). However, any other vectors, e.g. plas- 
mids, viruses, or parts thereof, may be used as long as they are replicable and viable in the desired host. The vector 
may also comprise sequences which enable it to replicate in the host whose genome is to be modified. The use of 
such a vector can expand the interaction period during which recombination can occur, increasing the efficiency of 
targeting (see Molecular Biology, ed. Ausubel et al, Unit 9.16, Fig. 9.16.1). 

[0231] The specific host employed for propagating the targeting vectors described above is not critical. Examples 
include E. CO//K12 RR1 (Bolivar et al., Gene 2: 95, 1977), E. co//K12 HB101 (ATCC No. 33694), E. co// MM21 (ATCC 
No. 336780), E. coli DH 1 (ATCC No. 33849). E. coli strain DH5u, and E. cod STBL2. Alternatively, hosts such as C. 
cerevisiae can be used The above-mentioned hosts are available commercially (e.g., Stratagene, La Jolla, CA, and 

so Life Technologies, Rockville, MD). 

[0232] To create the targeting vector, a PFI-001 gene targeting construct is added to an above-described vector 
backbone. The PFI-001 gene targeting constructs described above have at least one PFI-001 gene homology region. 
To make the PFI-001 gene homology regions, a PFI-001 gene-related sequence is used as a basis for producing 
polymerase chain reaction (PCR) primers. These primers are used to amplify the desired region of the PFI-001 se- 

55 quence by high fidelity PCR amplification (Mattila et al.. Nucleic Acids Res. 19 4967, 1991; Eckert and Kunkel 1:17. 
1991 ; and U S Pat No 4,683,202) The genomic sequence is obtained from a genomic clone library or from a prep- 
aration of genomic DNA, preferably from the animal species that is to be targeted for PFI-001 gene disruption 
[0233] Preferably, the targeting constructs described above also include an exogenous nucleotide sequence encod- 



21 

BNSDOCID <EP _1090969A1 I > 



EP 1 090 989 A1 



ing a positive marker prote n The stable expression of a positive marker after vector integration confers an identif ab e 
characteristic on the cell without compromising cell viability Therefore, m the case of a replacement vector, the marker 
gene is positioned between two flanking homology regions so that it integrates into the PFl-QOi gene following the 
double crossover event 

5 [0234] It is preferred that the positive marker protein is a selectable protein; the stable expression of such a protean 
m a cell confers a selectable phenotypic characteristic, i.e., the characteristic enhances the survival of the cell under 
otherwise lethal conditions Tnus. by imposing the selectable condition, one can isolate cells that stably express the 
positive selectable marker from other cells that have not successfully integrated the vector sequence on the oasis of 
viability. Examples of positive selectable marker proteins (and their agents of selection) include Neo (G418 or kano- 

10 mycin), Hyg (hygromycin), HisD (histidinol), Gpt (xanthine), Ble (bleomycin), andHprt (hypoxanthine) (see. e.g., Capec- 
chi and Thomas, U.S. Pat. No. 5,464.764, and Capecchi, Science 244: 1288-92, 1989). Other positive markers that 
may also be used as an alternative to a selectable marker include reporter proteins such as p-galactosidase, firefly 
luciferase. or green fluorescent protein (see, e.g., Current Protocols in Cytometry, Unit 9.5, and Current Protocols m 
Molecular Biology, Unit 9.6, John Wiley & Sons. New York, NY, 2000). 

>5 [0235] The above-described positive selection scheme does not distinguish between cells that have integrated the 
vector by targeted homologous recombination at the PFl-001 gene locus versus random, non-homologous integration 
of vector sequence into any chromosomal position. Therefore, when using a replacement vector for homologous re- 
combination, it is also preferred to include a nucleotide sequence encoding a negative selectable marker protein. 
Expression of a negative selectable marker causes a cell expressing the marker to lose viability when exposed to a 

20 certain agent (i.e.. the marker protein becomes lethal to the cell under certain selectable conditions) Examples of 
negative selectable markers (and their agents of lethality) include herpes simplex virus thymidine kinase (gancyclovir 
or 1 ,2-deoxy-2-fluoro-a-d-arabinofuransyl-5-»odouracil) 1 Hprt (6-thioguanine or 6-thioxanthine), and diphtheria toxin, 
ricin toxin, and cytosine deaminase (5-fluorocytosine). 

[0236] The nucleotide sequence encoding the negative selectable marker is positioned outside of the two homology 

25 regions of the replacement vector. Given this positioning, cells will only integrate and stably express the negative 
selectable marker if integration occurs by random, non-homologous recombination; homologous recombination be- 
tween the PFl-001 gene and the two regions of homology in the targeting construct excludes the sequence encoding 
the negative selectable marker from integration. Thus, by imposing the negative condition, cells that have integrated 
the targeting vector by random, non-homologous recombination lose viability 

30 [0237] The above-described combination of positive and negative selectable markers is preferred because a series 
of positive and negative selection steps can be designed to more efficiently select only those ceils that have undergone 
vector integration by homologous recombination, and therefore, have a potentially disrupted PFl-001 gene. Further 
examples of positive-negative selection schemes, selectable markers, and targeting constructs are described, for ex- 
ample, in U.S. Pat. No. 5,464,764, WO 94/06908, and Valancius and Smithies. Mol. Cell. Biol. 11: 1402, 1991. 

35 [0238] In order for a marker protein to be stably expressed upon vector integration, the targeting vector may be 
designed so that the marker coding sequence is operably linked to the endogenous PFl-001 gene promoter upon 
vector integration. Expression of the marker is then driven by the PFl-001 gene promoter in cells that normally express 
PFl-001 gene. Alternatively, each marker in the targeting construct of the vector may contain its own promoter that 
drives expression independent of the PFl-001 gene promoter. This latter scheme has the advantage of allowing for 

-to expression of markers in cells that do not typically express the PFl-001 gene (Smith and Berg, Cold Spring Harbor 
Symp. Quant. Biol. 49: 171, 1 984; Sedivy and Sharp. Proc. Natl. Acad. Sci. (USA) 86: 227: 1989; Thomas and Capecchi, 
Cell 51: 503. 1987). 

[0239] Exogenous promoters that can be used to drive marker gene expression include cell-specific orstage-specific 
promoters, constitutive promoters, and inducible or regulataDle promoters. Nonlimiting examples of these promoters 

4 $ include the herpes simplex thymidine kinase promoter, cytomegalovirus (CMV) promoter/enhancer, SV40 promoters, 
PGK promoter, PMC 1 -neo, metallothionem promoter, adenovirus late promoter, vaccinia virus 7.5K promoter, avian 
beta globin promoter, histone promoters (e.g., mouse histone H3-614), beta actin promoter, neuron-specific enoiase, 
muscle actin promoter, and the cauliflower mosaic virus 35S promoter (see generally, Sambrook et al., Molecular 
C'tomng, Vols, i-lll, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989, and Current Protocols in 

50 Molecular Biology, John Wiley & Sons. New York, NY, 2000; Stratagene, La Jolla, CA). 

[0240] To confirm whether cells have integrated the vector sequence into the targeted PFl-001 gene locus, primers 
or genomic probes that are specific for the desired vector integration event can be used in combination with PCR or 
Southern blot analysis to identify the presence of the desired vector integration into the PFl-001 gene locus (Erlich et 
al , Science 252: 1643-51 1991; Zimmer and Gruss, Nature 338: 150, 1989; Mouellic et al., Proc. Natl. Acad. Set. 

55 (USA) 87: 4712, 1990; and Shesely et al.. Proc. Natl. Acad. Sci. (USA) 88: 4294, 1991). 
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Gene trapping 

[0241] Another method available for inserting a foreign nucleic acid sequence into the PFl-001 gene locus to func- 
tionally disrupt the PFl-001 gene is gene trapping. This method takes advantage of the cellular machinery present m 

5 all mammalian cells that splices exons into mRNA to insert a gene trap vector coding sequence into a gene in a random 
fashion. Once inserted, the gene trap vector creates a mutation that may functionally disrupt the trapped PFl-001 gene. 
In contrast to homologous recombination, this system for mutagenesis creates largely random mutations. Thus, to 
obtain a genetically modified cell that contains a functionally dirsupted PFl-001 gene, cells containing this particular 
mutation must be identified and selected from a pool of cells that contain random mutations in a variety of genes. 

to [0242] Gene trapping systems and vectors have been described for use in genetically modifying murine cells and 
other cell types (see. e.g., Allen et al., Nature 333: 852-55, 1988; Bellen et al,, Genes Dev 3: 1288-1300, 1989; Bier 
et al., Genes Dev. 3: 1273-1287, 1989; Bonnerot et al., J. Virol. 66; 4982-91, 1992; Brenner et al., Proc. Nat. Acad. 
Sci. USA 86: 551 7-21 , 1 989; Chang et al., Virology 1 93: 737-47, 1 993; Friedrich and Soriano, Methods Enzymol. 225: 
681-701, 1993; Friedrich and Soriano, Genes Dev. 5: 1513-23, 1991; Goff, Methods Enzymol. 152; 469-81, 1987; 

is Gossler et al., Science 244: 463-65. 1989; Hope, Develop 113: 399-408, 1991 ; Kerr et al., Cold Spring Harb. Symp. 
Quant. Biol. 2: 767-776. 1989; Fteddy et al., J. Virol. 65: 1507-1515, 1991; Reddy et al., Proc. Natl. Acad. Sci. U S.A. 
89: 6721-25. 1992; Skarnes et al.. Genes Dev. 6: 903-918, 1992; von Melchner and Ruley, J. Virol. 63: 3227-3233, 
1989. and Yoshida et al.. Transgen. Res. 4: 277-87. 1995) 

[0243] Promoter trap (5' trap) vectors contain, in 5' to 3' order, a splice acceptor sequence followed by an exon, which 
20 is typically characterized by a translation initiation codon and open reading frame (ORF) and/or an internal ribosome 
entry site. In general, these promoter trap vectors do not contain promoters or operably linked splice donor sequences. 
Consequently, after integration into the cellular genome of the host cell, the promoter trap vector sequence intercepts 
the normal splicing of the upstream gene and acts as a terminal exon. Expression of the vector coding sequence is 
dependent upon the vector integrating into an intron of the disrupted gene in the proper reading frame. In such a case, 
25 the cellular splicing machinery splices exons from the trapped gene upstream of the vector coding sequence (Zam- 
browicz et al., WO 99/50426). 

[0244] An alternative method for producing an effect similar to the above-described promoter trap vector is a vector 
that incorporates a nested set of stop codons present in, or otherwise engineered into, the region between the splice 
acceptor of the promoter trap vector and the translation initiation codon or polyadenylation sequence. The coding 
30 sequence can also be engineered to contain an independent ribosome entry site (IRES) so that the coding sequence 
will be expressed in a manner largely independent of the site of integration within the host cell genome. Typically, but 
not necessarily, an IRES is used in conjunction with a nested set of stop codons. 

[0245] Another type of gene trapping scheme uses a 3' gene trap vector. This type of vector contains, in operative 
combination, a promoter region, which mediates expression of an adjoining coding sequence, the coding sequence, 

35 and a splice donor sequence that defines the 3' end of the coding sequence exon. After integration into a host cell 
genome, the transcript expressed by the vector promoter is spliced to a splice acceptor sequence from the trapped 
gene that is located downstream of the integrated gene trap vector sequence. Thus, the integration of the vector results 
in the expression of a fusion transcript comprising the coding sequence of the 3' gene trap cassette and any downstream 
cellular exons. including the terminal exon and its polyadenylation signal. When such vectors integrate into a gene, 

40 the cellular splicing machinery splices the vector coding sequence upstream of the 3' exons of the trapped gene. One 
advantage of such vectors is that the expression of the 3' gene trap vectors is driven by a promoter within the gene 
trap cassette and does not require integration into a gene that is normally expressed in the host cell (Zambrowicz et 
al., WO 99/50426). Examples of transcriptional promoters and enhancers that may De incorporated into the 3' gene 
trap vector include those discussed above with respect to targeting vectors, 

■*5 [0246] The viral vector backbone used as the structural component for the promoter or 3' gene trap vector may be 
selected from a wide range of vectors that can be inserted into the genome of a target cell. Suitable backbone vectors 
include, but are not limited to, herpes simplex virus vectors, adenovirus vectors, adeno-associated virus vectors, ret- 
roviral vectors, lentiviral vectors, pseudorabies virus, alpha-herpes virus vectors, and the like. A thorough review of 
viral vectors, in particular, viral vectors suitable for modifying non-replicating cells and how to use such vectors m 

50 conjunction with the expression of an exogenous polynucleotide sequence, can be found in Viral Vectors: Gene Therapy 
and Neuroscience Applications, Eds. Caplitt and Loewy, Academic Press, San Diego, 1995. 

[0247] Preferably, retroviral vectors are used for gene trapping. These vectors can be used in conjunction with ret- 
roviral packagmgcell lines such as those described in U.S. Patent No. 5,449,614. Where non-murine mammalian cells 
are used as target cells for genetic modification, amphotropic orpantropic packaging cell lines can be used to pacKage 
55 suitable vectors (Ory et al. , Proc. Natl. Acad. Sci., USA 93: 1 1 400-1 1 406, 1 996). Representative retroviral vectors that 
can be adapted to create the presently described 3' gene trap vectors are described, for example, in U S Pat No 
5.521.076. 

[0248] The gene trapping vectors may contain one or more of the positive marker genes discussed above with respect 
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to targeting vectors used for homologous recombination Similar to their use tn targeting vectors these positive markers 
are used m gene trapping vectors to identify and select cells that have integrated the vector into the cell genome The 
marker gene may be engineered to contain an independent nbosome entry site (IRES) so that the marker will be 
expressed m a manner largely maependent of the location in which the vector has integrated into the target cell genome 

5 [0249] Grven that gene trap vectors will integrate into tne genome of infected host ceils m a fairly random manner, 
a genetically modified cell having a disrupted PR-001 gene must be identified from a population of cells that lave 
undergone random vector integration. Preferably, the genetic modifications in the population of cells are of sufficient 
randomness and frequency such that the population represents mutations m essentially every gene found in the cell's 
genome, making it likely that a cell with a disrupted PFI-001 gene will be identified from the population (see Zambrcwicz 

w et al. WO 99/50426, Sands et al.. WO 98/14614). 

[0250] Individual mutant ceii lines containing a disrupted PFI-001 gene are identified in a population of mutated cells 
using, for example, reverse transcription and PGR (RT-PCFO to identify a mutation in a PFI-001 gene sequence This 
process can be streamlined by pooling clones. For example, to find an individual clone containing a disrupted PFI-001 
gene. RT-PCR is performed using one primer anchored in the gene trap vector and the other phmer located in the PFI- 

1 $ 001 gene sequence. A positive RT-PCR result indicates that the vector sequence is encoded m the PFI-001 gene 
transcript, indicating that PFI-001 gene has been disrupted by a gene trap integration event (see, e.g. Sands et ai., 
WO 98/14614). 

Temporal, spatial, and inducible gene disruptions 

20 

[0251] A functional disruption of the endogenous PFI-001 gene can occur at specific developmental or cell cycle 
stages (temporal disruption) or in specific cell types (spatial disruption). The PFI-001 gene disruption can also be 
inducible when certain conditions are present. A recombinase excision system, such as a Cre-Lox system, may be 
used to activate or inactivate the PFI-001 gene at a specific developmental stage, in a particular tissue or cell type, or 

25 under particular environmental conditions. Generally, methods utilizing Cre-Lox technology are earned out as described 
by Torres and Kuhn, Laboratory Protocols for Conditional Gene Targeting, Oxford University Press. 1997 Methodology 
similar to that described for the Cre-Lox system can also be employed utilizing the FLP-FRT system. Further guidance 
regarding the use of recombinase excision systems for conditionally disrupting genes by homologous recombination 
orviral insertion is provided, for example, in U.S. Pat No. 5,626,159, U.S. Pat No. 5.527.695, U.S. Pat. No. 5,434,066, 

30 WO 98/29533, Orban et al., Proc. Nat. Acad. Sci. USA 89: 6861-65, 1992; O'Gorman et al., Science 251: 1351-55, 
1 991 : Sauer et al., Nucleic Acids Research 17: 147-61 , 1989; Barinaga, Science 265: 26-28. 1994; and Akagi et al., 
Nucleic Acids Res. 25: 1766-73, 1997. More than one recombinase system can be used to genetically modify an animal 
cell. 

[0252] When using homologous recombination to disrupt the PFI-001 gene in a temporal, spatial, or inducible fashion, 
35 using a recombinase system such as the Cre-Lox system, a portion of the PFI-001 gene coding region is replaced by 
a targeting construct comprising the PFI-001 gene coding region flanked by loxP sites. Animal cells carrying this genetic 
modification contain a functional, loxP-flanked PFI-001 gene. The temporal, spatial, or inducible aspect of the PFI-001 
gene disruption is caused by the expression pattern of an additional transgene, a Cre recombinase transgene, that is 
expressed in the animal cell under the control of the desired spatially-regulated, temporally-regulated, or inducible 
•*o promoter, respectively. A Cre recombinase targets the loxP sites for recombination. Therefore, when Cre expression 
is activated, the LoxP sites undergo recombination to excise the sandwiched PFI-001 gene coding sequence, resulting 
in a functional disruption of the PFI-001 gene (Rajewski etal., J. Clin. Invest. 98 600-03, 1996; St.-Ongeet al.. Nucleic 
Acids Res. 24: 3875-77, 1996; Agah et al., J. Clin Invest. 100: 159-79, 1997; Brocard et al. Proc. Natl. Acad. Sci. 
USA 94 14559-63, 1997; Feil et al.. Proc. Natl. Acad. Sci. USA 93: 10887-90, 1996; and Kiihn et al., Science 269: 
•*5 1427-29. 1995). 

[0253] A cell containing both a Cre recombinase transgene and loxP-flanked PFI-001 gene can be generated through 
standard transgenic techniques. Further guidance regarding the use of recombinase systems specific promoters to 
temporally, spatially, or conditionally disrupt the PFI-001 gene is found, for example, in Sauer, Meth. Enz. 225: 890-900. 
1993. Gu et al., Science 255: 103-06, 1994, Araki et al., J. Biochem. 122: 977-82, 1997, Dymecki, Proc. Natl. Acad 

so Sci. 93: 6191-96, 1996. and Meyers et al., Nature Genetics 18: 136-41 , 1998. 

[0254] An inducible disruption of the PFI-001 gene can also be achieved by using a tetracycline responsive binary 
system (Gosscn and Bujard, Proc. Natl. Acad Sci. USA 89: 5547-51 , 1992). This system involves genetically modifying 
a cell to introduce a Tet promoter into the endogenous PFI-001 gene regulatory element and a transgene expressing 
a tetracychne-controllable repressor (TetR). In such a cell, the administration of tetracycline activates the TetR wrucn, 

55 in turn, inhibits PFI-001 gene expression and, therefore, functionally disrupts the PFI-001 gene (St.-Ongeet al., Nucleic 
Acids Res 24 3875-77. 1996, U S Patent No 5,922,927). 

[0255] The above-described systems for temporal, spatial, and inducible disruptions of the PFI-001 gene can also 
be adopted when using gene trapping as the method of genetic modification, for example, as described, for example, 
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m WO 98/29533. 

Creating genetically modified animal cells 

5 [0256] The above-described methods for genetic modification can be used to functionally disrupt a PFI-001 gene m 
virtually any type of somatic or stem ceil derived from an animal. Genetically modified animal cells of the invention 
include, but are not limited to, mammalian cells, including human cells, and avian cells. These cells may be derived 
from genetically engineering any animal cell line, such as culture-adapted, tumorigenic, or transformed cell lines, or 
they may be isolated from a genetically modified, non-human mammal carrying the desired PFI-001 genetic modtfica- 

io tion. 

[0257] The cells may be heterozygous or homozygous for the disrupted PFI-001 gene. To obtain cells that are ho- 
mozygous for the PFI-001 gene disruption (PFI-001-/-), airect. sequential targeting of both alleles can be performed. 
This process can be facilitated by recycling a positive selectable marker. According to this scheme the nucleotide 
sequence encoding the positive selectable marker is removed following the disruption of one allele using the Cre-Lox 
15 p system. Thus, the same vector can be used in a subsequent round of targeting to disrupt the second PFI-001 gene 
allele (Abu in and Bradley. Mol. Cell. Biol. 16: 1851-56, 1996; Sedivy et al., T.l.G. 15: 88-90, 1999; Cruz et al., Proc. 
Natl. Acad. Sci. (USA) 88; 7170-74. 1991 : Mortensen et al., Proc. Natl. Acad. Sci (USA) 88: 7036-40, 1991 ; te Rieie 
et al.. Nature (London) 348; 649-651, 1990). 

[0258] An alternative strategy for obtaining ES cells that are PFI-001 -/- is the homogenotization of cells from a pop- 

20 ulation of cells that is heterozygous for the PFI-001 gene disruption (PFI-001 +■/-). The method uses a scheme in which 
PFI-001 +/- targeted clones that express a selectable drug resistance marker are selected against a very high drug 
concentration; this selection favours cells that express two copies of the sequence encoding the drug resistance marker 
and are, therefore, homozygous for the PFI-001 gene disruption (Mortensen et al., Mol. Cell. Biol. 12: 2391-95, 1992). 
[0259] Following the genetic modification of the desired cell or cell line, the PFI-001 gene locus can be confirmed 

25 as the site of modification by PCR analysis according to standard PCR or Southern blotting methods known in the art 
(see, e.g., U.S. Pat. No. 4,683,202; and Erlich et al., Science 252: 1643, 1991). Further verification of the functional 
disruption of the PFI-001 gene may also be made if PFI-001 gene messenger RNA (mRNA) levels and/or PFI-001 
polypeptide levels are reduced in cells that normally express the PFI-001 gene. Measures of PFI-001 gene mRNA 
levels may be obtained by using reverse transcriptase mediated polymerase chain reaction (RT-PCR), Northern blot 

30 analysis, or in situ hybridization. The quantification of PFI-001 polypeptide levels produced by the cells can be made, 
for example, by standard immunoassay methods known in the art. Such immunoassays include but are not limited to, 
competitive and non-competitive assay systems using techniques such as radioimmunoassays. ELISA (enzyme-linked 
immunosorbent assay), "sandwich" immunoassays, immunoradiometric assays, gel diffusion precipitin reactions, im- 
munodiffusion assays, in situ immunoassays (using colloidal gold, enzymatic, or radioisotope labels, for example), 

35 western blots, 2-dimensional gel analysis, precipitation reactions, immunofluorescence assays, protein A assays, and 
immunoelectrophoresis assays. 

[0260] Preferred genetically modified animal cells are embryonic stem (ES) cells and ES-like cells. These cells are 
derived from the preimplantation embryos and blastocysts of various species, such as mice (Evans et al., Nature 129; 
154-156, 1981 ; Martin, Proc. Natl. Acad. Sci., USA, 78: 7634-7638, 1 981), pigs and sheep (Notanianni etal., J. Reprod. 
io Fert. Suppl., 43: 255-260, 1991 ; Campbell et al., Nature 380: 64-68,1996) and primates, including humans (Thomson 
etal., U.S. Patent No. 5,843,780, Thomson etal.. Science 282: 1145-1147, 1995; and Thomson etal., Proc. Natl. Acad. 
Set. USA 92: 7844-7848, 1995). 

[0261] These types of cells are plunpotent. That is, under proper conditions, they differentiate into a wide variety of 
cell types derived from all three embryonic germ layers: ectoderm, mesoderm and endoderm. Depending upon the 

4 5 culture conditions, a sample of ES cells can be cultured indefinitely as stem cells, allowed to differentiate into a wide 
variety of different cell types within a single sample, or directed to differentiate into a specific cell type, such as mac- 
rophagelike cells, neurona) cells, cardiomyocytes, adipocytes, smooth muscle cells, endothelial cells, skeletal muscle 
cells, keratmocytes. and hematopoietic cells, such as eosinophils, mast cells, erythroid progenitor cells, or megakary- 
ocytes. Directed differentiation is accomplished by including specific growth factors or matrix components in the culture 

50 conditions, as further described, for example, in Keller et al., Curr. Opin. Cell Biol. 7. 862-69, 1995, Li et al.. Curr. Biol. 
8: 971 , 1998, Klug et al., J. Clin. Invest. 98: 216-24, 1996, Lieschke et al.. Exp. Hematol. 23: 328-34, 1995, Yamane 
ct al.. Blood 90: 3515-23, 1997, and Hirashima et al., Blood 93: 1253-63. 1999. 

[0262] The particular embryonic stem cell line that is used for genetic modification is not critical; exemplary murine 
ES cell lines include AB-t (McMahon and Bradley, Cell 62:1073-85, 1990), E14 (Hooper et al., Nature 326: 292-95. 
55 1987), D3 (Doetschman et al.. J. Embryol. Exp. Morph. 87: 27-45, 1985), CCE (Robertson et al, Nature 323: 445-48, 
1986), RW4 (Genome Systems. St. Louis, MO), and DBA/llacJ (Roach et ai , Exp. Cell Res. 221 520-25. 1995) 
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PRODUCTION OF THE POLYPEPTIDE 

[0263] According to the present invention, the production cf the polypeptiae cf the present invention can be effected 
by the cultunng of eukaryotic or prokaryotc expression hosts, which have been transformed with one or more pcynu- 

5 cleotides o f the presen: invention, m a conventional nutrient fermentation medium. The selection of the appropriate 
medium may be based on tie choice of expression hosts and/or based on the regulatory requirements of the expression 
construct. Such media are well-known to those skilled in the art. The medium may. if desired, contain additional com- 
ponents favouring the transformed expression hosts over other potentially contaminating micro- organ isms. 
[0264] Thus, the present invention also provides a method for producing a polypeptide having PFi-001 activity, the 

to method comprising the steps of (a) transforming a host cell with a nucleotide sequence shown in SEQ ID NO; 1 or a 
derivative, homologue, variant, analogue or fragment thereof; and (b) cultunng the transformed host cell under conai- 
tions suitable for the expression of said polypeptide. 

[0265] The present invention also relates to a method for producing a polypeptide having PFI-001 activity, the method 
comprising the steps of (a) cultunng a host cell that has been transformed with a nucleotide sequence shown in SEQ 
'5 id NO: 1 or a derivative, homologue. variant, analogue or fragment thereof under conditions suitable for the expression 
of said polypeptide; and (b) recovering said polypeptide from the host cell culture 

[0266] The present invention also relates to a method for producing a polypeptide having PFI-001 activity, the method 
comprising the steps of (a) transforming a host cell with a nucleotide sequence shown in SEQ ID NO: 1 or a derivative, 
homologue, variant, analogue or fragment thereof; (b) culturing the transformed host cell under conditions suitable for 
20 the expression of said polypeptide, and (c) recovering said polypeptide from the host cell culture 

RIBOZYMES 

[0267] Ribozymes are enzymatic RNA molecules capable of catalysing the specific cleavage of RN A. The mechanism 
of ribozyme action involves sequence specific hybridisation of the ribozyme molecule to complementary target RNA, 
followed by endonucleolytic cleavage. Within the scope of the invention are engineered hammerhead motif ribozyme 
molecules that specifically and efficiently catalyse endonucleolytic cleavage of PFI-001 RNA sequences. 
[0268] Specific ribozyme cleavage sites within any potential RNA target are initially identified by scanning the target 
molecule for ribozyme cleavage sites which include the following sequences: GUA, GUU and GUC Once identified, 
short RNA sequences of between 1 5 and 20 ribonucleotides corresponding to the region of the target gene containing 
the cleavage site may be evaluated for secondary structural features which may render the oligonucleotide sequence 
inoperable The suitability of candidate targets may also be evaluated by testing accessibility to hybridisation with 
complementary oligonucleotides using ribonuclease protection assays. 

[0269] Both antisense RNA and DNA molecules and ribozymes of the invention may be prepared by any method 
known in the art for the synthesis of RNA molecules. These include techniques for chemically synthesising oligonu- 
cleotides such as solid phase phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated 
by in vitro or in vivo transcription of DNA sequences encoding the antisense RNA molecule. Such DNA sequences 
may be incorporated into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or SP6. Alter- 
natively, antisense cDNA constructs that synthesize antisense RNA constitutively or inducibly can be introduced into 
cell lines, cells or tissues. 

DETECTION 

[0270] The presence of the PFI-001 polynucleotide coding sequence can be detected by DNA-DNA or DNA-RNA 
hybridisation or amplification using probes, portions or fragments of the sequence presented in SEQ ID NO: 1 Nucleic 
acid amplification-based assays involve the use of oligonucleotides or oligomers based on the PFI-001 coding se- 
quence to detect transformants containing PFI-001 DNA or RNA. As used herein "oligonucleotides" or "oligomers" may 
refer to a nucleic acid sequence of at least about 10 nucleotides and as many as about 60 nucleotides, preferably 
about 1 5 to 30 nucleotides, and more preferably about 20 to 25 nucleotides which can be used as a probe or amphmer. 
so Preferably, oligonucleotides are derived from the 3' region of the nucleotide sequence shown in SEQ ID NO: 1 

[0271] A variety of protocols for detecting and measuring the expression of PFI-001 polypeptide, such as by using 
cither polyclonal or monoclonal antibodies specific for the protein, are known in the art. Examples include enzyme- 
Imked immunosorbent assay (ELISA), radioimmunoassay (RIA) and fluorescent activated cell sorting (FACS). A two- 
site, monoclonal-based immunoassay utilising monoclonal antibodies reactive to two non-interfering epitopes on a PFI- 
55 001 polypeptide is preferred, but a competitive binding assay may also be employed. These and other assays are 
described, among other places, in Hampton R etai. (1990, Serological Methods, A Laboratory Manual, APS Press, St 
Paul, MN, USA) and Maddox DE etai (1983. J Exp Med 15 8:1211). 

[0272] A wide variety of labels and conjugation techniques are known by those skilled in the art and can be used m 



26 



BNSDOClD <EP 1090969A1 I > 



EP 1 090 989 A1 

various nucleic and ammo aod assays Means for producing labelled hybridisation or PCR probes for detecting PFI- 
001 polynucleotide sequences include oligolabelhng, niCK translation, end-iabelling or PCR amplification using a la- 
belled nucleotide Alternatively, the PFi-001 coding sequence, or any portion of it, may be cloned into a vector for the 
production of an mRNA probe. Such vectors are known in the art, are commercially available, and may be used to 
5 synthesize FtNA prooes in vitro by addition of an appropriate RNA polymerase such as T7, T3 or SP6 and labelled 
nucleotides. 

[0273] A number of comoames such as Pharmacia Biotech (Piscataway, NJ, USA), Promega (Madison, Wi, USA), 
and US Biochemical Corporation (Cleveland, OH, USA) supply commercial kits and protocols for these procedures. 
Suitable reporter molecules or labels include those radionuclides, enzymes, fluorescent, chemilummescent, or chro- 
10 mogemc agents as well as substrates, cofactors, inhibitors, magnetic particles and the like Patents teaching the use 
Of such labels include US-A-381 7837; US-A-3850752; US-A-3939350; US-A-3996345; US-A-4277437; US-A-4275 1 49 
and US-A-4366241 . Also, recombinant immunoglobulins may be produced as shown in US-A-4816567. 
[0274] Additional methods to quantify the expression of a particular molecule include radiolabelling (Melby PC et at., 
1 993, J. Immunol Methods Vol 1 59 p235-44) or biotinylating (Duplaa C et aL 1 993, Ana! Biochem Vol 229 p36) nucle- 
us otides. co-amplification of a control nucleic acid, and standard curves onto which the experimental results are interpo- 
lated Quantification of multiple samples may be speeded up by running the assay in an ELISA format where the 
oligomer of interest is presented in various dilutions and a spectrophotometric or calorimetric response gives rapid 
quantification 

[0275] Although the presence/absence or marker gene expression suggests that the gene of interest is also present, 
20 its presence and expression should be confirmed. For example, if the PFI-001 coding sequence is inserted within a 
marker gene sequence, recombinant cells containing PFI-001 coding regions can be identified by the absence of 
marker gene function. Alternatively, a marker gene can be placed in tandem with a PFI-001 coding sequence under 
the control of a single promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of PFI-001 as well. 

25 [0276] Alternatively, host cells which contain the coding sequence for PFI-001 and express PFI-001 coding regions 
may be identified by a variety of procedures known to those of skill in the art. These procedures include, but are not 
limited to, DNA-DNA or DNA-RN A hybridisation and protein bioassay or immunoassay techniques which include mem- 
brane-based, solution-based, or chip-based technologies for the detection and/or quantification of the nucleic acid or 
protein. 

30 

ANTIBODIES 

[0277] The ammo acid sequence of the present invention can also be used to generate antibodies - such as by use 
of standard techniques - against the amino acid sequence. 

[0278] Procedures well known in the art may be used for the production of antibodies to PFI-001 polypeptides. Such 
antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, Fab fragments and fragments 
produced by a Fab expression library. Neutralising antibodies, i.e. those which antagonise biological activity of PFI- 
001 polypeptides, are especially preferred for diagnostics and therapeutics. 

[0279] For the production of antibodies, various hosts including goats, rabbits, rats, mice. etc. may be immunised 
by injection with the PFI-001 polypeptide or any portion, variant, homologue, fragment, analogue or derivative thereof 
or oligopeptide which retains immunogenic properties. Depending on the host species, various adjuvants may be used 
to increase immunologtcal response. Such adjuvants include, but are not limited to, Freund's, mineral gels such as 
aluminium Hydroxide, and surface active substances such as lysolecithm. pluronic polyols, polyanions. peptides, oil 
emulsions, keyhole limpet hemocyanin, and dimtrophenol. BCG (Bacilli Catmene-Guenn) ana Corynebactenum par- 
vum are potentially useful human adjuvants which may be employed. 

[0280] Monoclonal antibodies to the ammo acid sequence may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines m culture. These include, but are not limited to, the 
hybndoma technique originally described by Koehler and Milstem (1 975, Nature Vol 256 p495-497), the human B-cell 
hybridoma technique (Kosbor et al. (1983) Immunol Today Vol 4 p72, Cote era/. (1983) Proceedings of the National 
Academy of Sciences (USA) Vol 80 p2026-2030) and the EBV-hybridoma technique (Cole et al. (1985) Monoclonal 
Antibodies and Cancer Therapy, Alan R Uss Inc. pp. 77-96). In addition, techniques developed for the production of 
"chimeric antibodies", the splicing of mouse antibody genes to human antibody genes to obtain a molecule with ap- 
propriate antigen specificity and biological activity can be used (Morrison et al. (1984) Proceedings of the National 
Academy of Sciences (USA) Vol 81 p6851 -6855. Neuberger et al. { 1 984) Nature Vol 31 2 p604-608; Takeda et ai. (1 985) 
Nature Vol 314 p452-454). Alternatively, techniques described for the production of single chain antibodies (US-A- 
4946779) can be adapted to produce polypeptide-specific single chain antibodies 

[0281] Antibodies may also be produced by inducing in vivo production in the lymphocyte population or by screening 
recombinant immunoglobulin libraries or panels of highly specific binding reagents as disclosed in Oriandi et al. (1 989. 
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Proceeomgs cf the National Academy of Sciences (USA) Vol 86 p 3833-3837) and Winter G and Milstem C (1991 
Nature Vol 349 p293-299). 

[0282] Antibody fragments wtrch contain specific b nding sites for PFI-001 may also be generated. For example. 
SLCh fragments include but are not limited to the F(ab') 2 fragments whtcn can be produced by pepsin dgesticn cf the 

5 antibody molecule and the Fab fragments which can be generated by reducing the disulphide bodges of the F(ab') 2 
fragments Alternatively, Fab expression libraries may be constructed to allow rapid and easy identification of mono- 
clonal Fab f ragments with the desired specificity {Huse WD et ai. (1989) Science Vol 256 P1275-1281). 
[0283] An alternative technique involves screening phage display libraries where, for example the phage express 
scFv fragments on the surface of their coat with a large variety of complementarity determining regions (CDRs). This 

to technique is well known in the art. 

[0284] PFI-001 -specific antibodies are useful for the diagnosis of conditions and diseases associated with expression 
of the PFI-001 receptor. A variety of protocols for competitive binding or immunoradiometnc assays using either pol- 
yclonal or monoclonal antibodies with established specificities are well known in the art. Such immunoassays typically 
involve the formation of complexes between PFI-001 polypeptide and its specific antibody (or similar PFI-001 -binding 

75 molecule) and the measurement of complex formation. A two-site, monoclonal based immunoassay utilising mono- 
clonal antibodies reactive to two, non-mterfenng epitopes on a specific PFI-001 protein is preferred, Out a competitive 
binding assay may also be employed. These assays are described in Maddox DE etai (1983. Journal of Experimental 
Medicine Vol 158 pl2H). 

[0285] Anti-PFl-001 antibodies are useful for the diagnosis of disorders involving abnormal signal transduction or 
20 other disorders or diseases characterised by abnormal expression of a PFI-001 receptor. Diagnostic assays for PFI- 
001 include methods utilising the antibody and a label to detect a PFI-001 polypeptide in human body fluids, cells, 
tissues or sections or extracts of such tissues. The polypeptides and antibodies of the present invention may be used 
with or without modification. Frequently, the polypeptides and antibodies will be labelled by joining them, either cova- 
Icntly or noncovalently, with a reporter molecule. A wide variety of reporter molecules are known to those of skill in the 
25 art. 

[0286] Antibodies may be used in method of detecting polypeptides of the invention present in biological samples 
by a method which comprises: (a) providing an antibody of the invention; (b) incubating a biological sample with said 
antibody under conditions which allow for the formation of an antibody-antigen complex; and (c) determining whether 
antibody-antigen complex comprising said antibody is formed. 
30 [0287] Antibodies of the invention may be bound to a solid support and/or packaged into kits in a suitable container 
along with suitable reagents, controls, instructions and the like. 

ASSAYS/IDENTIFICATION METHODS 

35 [0288] The present invention also relates to an assay method for detecting the presence of PFI-001 in cells (such 
as human cells) comprising: (a) performing a reverse transcriptase-polymerase chain reaction (RT-PCR) on RNA (such 
as total RNA) from such cells using a pair of PCR primers that are specific for PFI-001 , as determined from the DNA 
sequence shown in SEQ ID NO; 1 or an allelic variation thereof; and (b) assaying the appearance of an appropriately 
sized PCR fragment - such as by agarose gel electrophoresis. 

40 [0289] There are numerous assays in which the polypeptide of the present invention can used to screen for modu- 
lators (e.g. antagonists or agonists) of the polypeptide. Examples of such assays include: 

Functional Assay - One example of a method for screening receptors to identify antagonists thereof is to monitor 
the inhibitory or stimulatory effect on cAMP or adenylate cyclase accumulation. Such an assay involves transfectmg 
•*s a mammalian cell with the receptor of the present invention for cell surface expression. The cell is then exposed 

to putative antagonists and the amount of cAMP accumulation is measured. If the putative antagonist binds the 
recepior the levels of receptor-mediated cAMP or adenylate cyclase activity will either increase or decrease. 

Functional Assay using a Fluorometric Imaging Plate Reader (FlipR) - A technique used for screening and includes 
so the use of cells that express the receptor of the present invention (for example transfected HEK293 cells) in a 

system that measures intracellular calcium or extracellular pH changes caused by receptor activation. In this tech- 
nique, cells expressing the receptor of the invention may be contacted with compounds (e.g. small molecules, 
peptides, lipids, nucleotides or glycoproteins) that cause a second messenger response, e.g. signal transduction, 
change in calcium levels or pH changes. These changes are used to determine whether the potential compound 
55 activates or inhibits the receptor. 

Ligand Binding Assay - This type of assay may test binding of a candidate compound, where adherence to the 
cells containing the receptor of the present invention is detected by means of a label directly or indirectly associated 
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with the candidate compound or m an assay involving competition with a labelled competitor Standard assays for 
conducting screens that determine if the compound activates or inhibits the receptor are well understood by those 



skilled m the art 



5 [0290] The present invention therefore also relates to a method of identifying agents (such as compounds, other 
substances or compositions comprising the same) that affect (such as antagonise, agonise or otherwise modify) the 
activity of PF1-001 and/or the expression thereof, the method comprising contacting PFI-001 or the nucleotide sequence 
coding for the same with the agent and then measuring the activity of PFI-001 and/or the expression thereof. 
[0291] The present invention also relates to a method of identifying agents (such as compounds, other substances 

10 or compositions comprising the same) that selectively affect (such as antagonise, agonise or otherwise modify) the 
activity of PFI-001 and/or the expression thereof, the method comprising contacting PFI-001 or the nucleotide sequence 
coding for the same with the agent and then measuring the activity of PFI-001 and/or the expression thereof. 
[0292] The present invention also relates to a method of identifying agents (such as compounds, other substances 
or compositions comprising the same) that affect (such as antagonise, agonise or otherwise modify) the activity of PFI- 

'5 001 and/or the expression thereof, the method comprising measuring the activity of PFI-001 and/or the expression 
morcot m the presence of the agent or after the addition of the agent in: (a) a ceil line into which has been incorporated 
recombinant DNA comprising the DNA sequence shown in SEQ ID NO: 1 or an allelic variation thereof, or (b) a cell 
population or cell line that naturally selectively expresses PFI-001 Preferably, the activity of PFI-001 is determined by 
the dbSciy methods described above. 

20 [0293] The present invention also relates to a method of identifying agents (such as compounds, other substances 
or compositions comprising the same) that selectively affect (such as antagonise, agonise or otherwise modify) the 
activity of PFI-001 and/or the expression thereof, the method comprising measuring the activity of PFI-001 and/or the 
express.on thereof in the presence of the agent or after the addition of the agent in: (a) a cell line into which has been 
incorporated recombinant DNA comprising the DNA sequence shown in SEQ ID NO: 1 or an allelic variation thereof. 

25 or (b) a cell population or cell line that naturally selectively expresses PFI-001 . Preferably, the activity of PFI-001 is 
determined by the assay methods described above. 

[0294] The present invention also relates to a method of screening an agent for modulation (preferably for specific 
modulation) of PFi-001 (or a derivative, homologue, variant, analogue or fragment thereof) activity or the expression 
of the nucleotide sequence coding for the same (including a derivative, homologue, variant, analogue or fragment 

so thereof), the method comprising the steps of: (a) providing a candidate agent: (b) combining PFI-001 (or the derivative, 
homologue. variant, analogue or fragment thereof) or the nucleotide sequence coding for the same (or the derivative, 
homologue. variant, analogue or fragment thereof) with the candidate agent for a time sufficient to allow modulation 
under suitable conditions; and (c) detecting modulation of the candidate agent to PFI-001 (orthe derivative, homologue. 
variant, analogue or fragment thereof) or the nucleotide sequence coding for the same (or the derivative, homologue, 

35 variant, analogue or fragment thereof) in order to ascertain if the candidate agent modulates PFI-001 (or the derivative, 
homologue. variant, analogue or fragment thereof) activity or the expression of the nucleotide sequence coding for the 
same (or the derivative, homologue, variant, analogue or fragment thereof) 

[0295] The present invention also relates to a method of screening an agent for specific binding affinity with PFI-001 
(or a derivative, homologue, variant, analogue or fragment thereof) or the nucleotide sequence coding for the same 

40 (including a derivative, homologue, variant, analogue or fragment thereof), the method comprising the steps of: (a) 
providing a candidate agent; (b) combining PFI-001 (orthe derivative, homologue, variant, analogue or fragment there- 
of) or the n ucleotide sequence coding for the same (or the derivative, homologue. variant, analogue or fragment thereof) 
with the candidate agent for a time sufficient to allow binding under suitable conditions; and (c) detecting binding of 
the candidate agent to PFI-001 (or the derivative, homologue. variant, analogue or fragment thereof) or the nucleotice 

45 sequence coding for the same (or the derivative, homologue , variant, analogue or fragment thereof) in order to ascertain 
if the candidate agent binds to PFI-001 (or the derivative, homologue, variant, analogue or fragment thereof) or the 
nucleotide sequence coding for the same (or the derivative, homologue, variant, analogue or fragment thereof). 
[0296] Thus, in certain embodiments of the present invention , PFI-001 or a variant, homologue, fragment, analogue 
or derivative thereof and/or a cell line that expresses the PFI-001 or variant, homologue, fragment, analogue or denv- 

50 ative thereof may be used to screen for antibodies, peptides, or other agents, such as organic or inorganic molecules, 
that act as modulators (e.g. antagonists or agonists) of PFI-001 activity or for the expression thereof, thereby identifying 
a therapeutic agent capable of modulating the receptor. Alternatively, screening of peptide libraries or organic libraries 
made by combinatorial chemistry with recombinant^ expressed PFI-001 or a variant, homologue, fragment, analogue 
or derivative thereof or cell lines expressing PFI-001 or a variant, homologue, fragment, analogue or derivative thereof 

55 may be useful for !dentification of therapeutic agents that function by modulating the receptor. Synthetic compounds, 
natural products, and other sources of potentially biologically active materials can be screened in a number of ways 
deemed to be routine to those of skill in the art. For example, nucleotide sequences encoding the N-terminal region of 
PFI-001 may be expressed in a cell line, which can be used for screening of allostenc modulators, either agonists or 
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antagonists o f PFI-001 activity 

[0297] A PFI-001 polypeptide, its immunogenic fragments or oligopeptides thereof can be used fcr screening ther- 
apeutic compounds n any of a variety of arug screening techniques The polypeptide employed in such a test may ce 
free m solution, affixed to a solid support, borne or a cel> surface, or located intracellular^. Tne 'ormation of binding 

5 complexes between a PFI-001 polypeptide and the agent being tested may be measured. 

[0298] Accordingly, the present invention relates to a method for screening one or a plurality of compounds for mod- 
ulation (preferably specific modulation, such as specific binding affinity) of PFI-001 or the expression thereof, or a 
portion thereof or variant, homologue, fragment, analogue or derivative thereof, comprising providing one or a plurality 
of compounds; combining a PFI-001 or a nucleotide sequence coding for the same or a portion thereof or variant. 

to homologue. fragment, analogue or derivative thereof with the or each of a plurality of compounds for a time sufficient 
to allow modulation under suitable conditions; and detecting binding of a PFI-001 , or portion thereof or variant, homo- 
logue, fragment, analogue or derivative thereof, to each of the plurality of compounds, thereby identifying the compound 
or compounds which modulate a PFI-001 or a nucleotide sequence coding for the same. In such an assay, the plurality 
of compounds may be produced by combinatorial chemistry techniques known to those of skill in the art. 

75 [0299] Another technique for drug screening provides for high througnput screening (HTS) of compounds having 
suitable binding affinity to the PFI-001 polypeptides and is based upon the method described in detail in Geysen, WO 
84/03564, published on September 13, 1984. in summary, large numbers of different small peptide test compounds 
are synthesized on a solid substrate, such as plastic pins or some other surface. The peptide test compounds are 
reacted with PFI-001 fragments and washed. A bound PFI-001 is then detected - such as by appropriately adapting 

20 methods well known in the art A purified PFI-001 can also be coated directly onto plates for use in the aforementioned 
drug screening techniques. Alternatively, non-neutralising antibodies can be used to capture the peptide and immobilise 
it on a solid support. 

[0300] This invention also contemplates the use of competitive drug screening assays in which neutralising antibodies 
capable of binding a PFI-001 polypeptide specifically compete with a test compound for binding a PFI-001. In this 
25 manner, the antibodies can be used to detect the presence of any peptide which shares one or more antigenic deter- 
minants with a PFI-001 

[0301] The assay method of the present invention may be a high throughput screen (HTS). In this regard, the teach- 
ings of WO 84/03564 may be adapted for the PFI-001 of the present invention. 

[0302] The teachings of US-A-5738985 may also be adapted for the assay method of the present invention 

30 

AGENTS 

[0303] The present invention also provides one or more agents identified by the assays methods and identification 
methods of the present invention. 
35 [0304] The agent of the present invention can be, for example, an organic compound or an inorganic compound. 
The agent can be, for example, a nucleotide sequence that is antisense to all or part of the sequence shown in SEQ 

ID NO: 1. 

[0305] The invention further provides an agent of the present invention (or even a pharmaceutical^ acceptable salt 
thereof, or a pharmaceutical^ acceptable solvate thereof) or a pharmaceutical composition containing any of the fore- 
•*o going, for use as a medicament. 

[0306] The present invention also relates to the use of an agent to affect PFI-001 activity (such as to antagonise, 
modulate or agonise its GPCR activity). 

DIAGNOSTICS 

45 

[0307] The present invention also provides a diagnostic composition for the detection of PFI-001 polynucleotide 
sequences The diagnostic composition may comprise the sequence shown in SEQ ID NO: 1 or a variant, homologue, 
fragment, analogue or derivative thereof, or a sequence capable of hybridising to all or part of the nucleotide sequence 
shown in SEQ ID NO: 1 or an allelic variation thereof. 

so [0308] In order to provide a basis for the diagnosis of disease, normal or standard values from a PFI-001 polypeptide 
expression should be established. This is accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with antibody to a PFI-001 polypeptide under conditions suitable for complex forma- 
tion which are well known m the art. The amount of standard complex formation may be quantified by comparing it to 
a dilution series of positive controls where a known amount of antibody is combined with known concentrations of a 

55 purified PFI-001 polypeptide. Then, standard values obtained from normal samples may be compared with values 
obtained from samples from subjects potentially affected by a disorder or disease related to a PFI-001 polypeptide 
expression Deviation between standard and subject values establishes the presence of the disease state 
[0309] A PFI-001 polynucleotide, or any part thereof, may provide the basis for a diagnostic and/or a therapeutic 
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compound For diagnostic purposes, PFI-001 polynucleotide sequences may be used to detect and quantify gene 
expression in conditions, disorders or diseases m which PFI-001 activity may be implicated. 

[0310] PFI-001 -encoding polynucleotide sequence may be used for the diagnosis of diseases resulting from expres- 
sion of PFI-001 . For example, polynucleotide sequences encoding PFI-001 may be used in hybridisation or PCR assays 

5 of tissues from biopsies or autopsies or biological fluids, such as serum, synovial fluid or tumour biopsy, to detect 
abnormalities in PFI-001 expression. The form of such qualitative or quantitative methods may include Southern or 
northern analysis, dot blot or other membrane-based technologies; PCR technologies; dip stick, pin or chip technolo- 
gies; and ELISA or other multiple sample format technologies. All of these techniques are well known in the art and 
are, in fact, the basis of many commercially available diagnostic kits. 

10 [0311] Such assays may be tailored to evaluate the efficacy of a particular therapeutic treatment regime and may 
be used in animal studies, in clinical trials, or in monitoring the treatment of an individual patient, in order to provide a 
basis for the diagnosis of disease, a normal or standard profile for PFI-001 expression should be established. This is 
accomplished by combining body fluids or cell extracts taken from normal subjects, either animal or human, with PFI- 
001 or a portion thereof, under conditions suitable for hybridisation or amplification Standard hybridisation may be 

1 $ quantified by comparing the values obtained for normal subjects with a dilution series of positive controls run m the 
same experiment where a known amount of purified PFI-001 is used. Standard values obtained from normal samples 
may be compared with values obtained from samples from subjects potentially affected by a disorder or disease related 
to expression of the PFI-001 coding sequence. Deviation between standard and subject values establishes the pres- 
ence of the disease state. If disease is established, an existing therapeutic agent is administered, and treatment profile 

20 or values may be generated. Finally, the assay may be repeated on a regular basis to evaluate whether the values 
progress toward or return to the normal or standard pattern. Successive treatment profiles may be used to show the 
efficacy of treatment over a period of several days or several months. 

[0312] Thus, the present invention relates to the use of a PFI-001 polypeptide, or variant, homologue, fragment, 
analogue or derivative thereof, to produce anti-PFI-001 antibodies which can, for example, be used diagnostically to 

25 detect and quantify PFI-001 levels in disease states. 

[0313] The present invention further relates to diagnostic assays and kits for the detection of PFI-001 in cells and 
tissues comprising a purified PFI-001 which may be used as a positive control, and anti-PFI-001 antibodies. Such 
antibodies may be used in solution-based, membrane-based, or tissue-based technologies to detect any disease state 
or condition related to the expression of PFI-001 protein or expression of deletions or a variant, homologue, fragment, 

30 analogue or derivative thereof. 

PROBES 

[0314] Another aspect of the subject invention is the provision of nucleic acid hybridisation or PCR probes which are 

35 capable of detecting polynucleotide sequences, including genomic sequences, encoding PFI-001 coding region or 
closely related molecules, such as alleles. The specificity of the probe, i.e. whether it is derived from a highly conserved, 
conserved or non-conserved region or domain, and the stringency of the hybridisation or amplification (high, interme- 
diate or low) will determine whether the probe identifies only naturally occurring PFI-001 coding sequence, or related 
sequences Probes for the detection of related nucleic acid sequences are selected from conserved or highly conserved 

40 nucleotide regions of PFI-001 polynucleotides, such as the 3' region, and such probes may be used in a pool of de- 
generate probes. For the detection of identical nucleic acid sequences, or where maximum specificity is desired, nucleic 
acid probes are selected from the non-conserved nucleotide regions or unique regions of PFI-001 polynucleotides. As 
used herein, the term "non-conserved nucleotide region" refers to a nucleotide region that is unique to the PFI-001 
coding sequence disclosed herein and does not occur in related sequences. 

4 5 [0315] PCR, as described in US-A-46831 95, US-A-4800195 and US-A-4965188 provides additional uses for oligo- 
nucleotides based upon the PFI-001 sequence. Such oligomers are generally chemically synthesized, but they may 
be generated en/ymaticaliy or produced from a recombinant source. Oligomers generally comprise two nucleotide 
sequences one with sense orientation (5'-»3') and one with antisense (3'<— 5') employed under optimised conditions 
for identification of a specific gene or condition. The same two oligomers, nested sets of oligomers, or even a degenerate 

50 pool of oligomers may be employed under less stringent conditions for detection and/or quantification of closely related 
DNA or RNA sequences. 

[0316] The nucleic acid sequence for PFI-001 can also be used to generate hybridisation probes as previously de- 
scribed, for mapping the endogenous genomic sequence. The sequence may be mapped to a particular chromosome 
or to a specific region of the chromosome using well known techniques. These include in situ hybridisation to chromo- 
55 somal spreads (Verma era/. (1988) Human Chromosomes A Manual of Basic Techniques, Pergamon Press, New 
York City. USA), flow-sorted chromosomal preparations, or artificial chromosome constructions such as yeast artificial 
chromosomes ( YACs), bacterial artificial chromosomes (BACs), bacterial PI constructions orsingle chromosome cDIMA 
libraries 
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[031 7] In situ hybridisation of cnromosomal preparations and physical mapping techniques such as linkage analysis 
using established chromosomal marke r s are invaluable ir extending genetic maps Examples of genetic maps can ce 
fojnd m Science 1 1995. 270 41 Of and 1994; 265 1 981 f) Often the placement of a gene on the chromosome of another 
mammalian species may reveal associated markers even if the number or arm of a particular human chromosome is 

5 not known . New sequences can be assigned to chromosomal arms, or parts thereof by physical mapping This provides 
valuable information to investigators searching for disease genes using positional cloning or other gene discovery 
techniques Once a disease or syndrome, such as ataxia telangiectasia (AT), has been crudely localised by genetic 
linkage to a particular genomic region, for example, AT to nq22-23 (Gatti et a/ (1988) Nature 336:577-580), any se- 
quences mapping to that area may represent associated or regulatory genes for further investigation. The nucleotide 

1 o sequence of the subject invention may also be used to detect differences in the chromosomal location due to translo- 
cation, inversion, etc. between normal, carrier or affected individuals. 

PHARMACEUTICALS 

7 5 [0318] The present invention also provides a pharmaceutical composition for treating an individual m need of the 
same due to PFi-001 activity, the composition comprising a therapeutically effective amount of an agent that modulates 
(such as antagonises or agonises) said activity and a pharmaceutical^ acceptable carrier, diluent, excipient or adjuvant 
[0319] Thus, the present invention also covers pharmaceutical compositions comprising the agents of the present 
invention (an agent capable of modulating the expression pattern of the nucleotide sequence of the present invention 

20 or the activity of the expression product thereof and/or an agent identified by an assay according to the present inven- 
tion). In this regard, and in particular for human therapy, even though the agents of the present invention can be 
administered alone, they will generally be administered in admixture with a pharmaceutical carrier, adjuvant, excipient 
or diluent selected with regard to the intended route of administration and standard pharmaceutical practice. 
[0320] By way of example, in the pharmaceutical compositions of the present invention, the agents of the present 

25 invention may be admixed with any suitable binder(s), lubricant(s), suspending agent(s), coating agent(s), or solubi- 
lising agent(s). 

[0321] In general, a therapeutically effective daily oral or intravenous dose of the agents of the present invention is 
likely to range from 0 01 to 50 mg/kg body weight of the subject to be treated, preferably 0.1 to 20 mg/kg. The agents 
of the present invention may also be administered by intravenous infusion, at a dose which is likely to range from 
30 0. 001-10 mg/kg/hr. 

[0322] Thus, the present invention also provides a method of treating an individual in need of the same due to PFI- 
001 activity comprising administering to said individual an effective amount of the pharmaceutical composition of the 
present invention. 

[0323] Typically, the physician will determine the actual dosage which will be most suitable for an individual patient 
35 and it will vary with the age, weight, sex and response of the particular patient. The above dosages are exemplary of 
the average case There can, of course, be individual instances where higher or lower dosage ranges are merited, and 
such are within the scope of this invention. 

[0324] Where appropriate, the pharmaceutical compositions can be administered by inhalation, in the form of a sup- 
pository or pessary, topically in the form of a lotion, solution, cream, ointment or dusting powder, by use of a skin patch, 

•to orally in the form of tablets containing excipients such as starch or lactose, or in capsules or ovules either alone or m 
admixture with excipients. or in the form of elixirs, solutions or suspensions containing flavouring or colouring agents, 
or they can be injected parenterally, for example intracavemosally, intravenously, intramuscularly or subcutaneously 
For parenteral administration, the compositions may be best used m the form of a sterile aqueous solution which may 
contain other substances, for example enough salts or monosaccharides to make the solution isotonic with blood. For 

4 $ buccal or sublingual administration the compositions may be administered in the form of tablets or lozenges which can 
be formulated in a conventional manner. 

[0325] For oral parenteral, buccal and sublingual administration to subjects (such as patients), the daily dosage 
level of the agents of the present invention may typically be from 10 to 500 mg (in single or divided doses). Thus, and 
by way of example, tablets or capsules may contain from 5 to 100 mg of active agent for administration singly, or two 
50 or more at a time, as appropriate, it is also possible to administer the agents of the present invention in sustained 
release formulations. 

[0326] In some applications, generally in humans, oral administration of the agents of the present invention is the 
preferred route, being the most convenient and can in some cases avoid disadvantages associated with other routes 
of administration - such as those associated with intracavernosal (i.e.) administration. In circumstances where the 
55 recipient suffers from a swallowing disorder or from impairment of drug absorption after oral administration, the drug 
may be administered parenterally, sublingually or buccally. 

[0327] For veterinary use, the agent of the present invention is typically administered as a suitably acceptable for- 
mulation in accordance with normal veterinary practice and the veterinary surgeon will determine the dosing regimen 
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and route of administration which will be most appropriate for a particular animal However, as with human treatments, 
it may be possible to administer the agent alone for veterinary treatments. 

[0328] Typically, the pharmaceutical compositions - which may be for human or animal usage - will comprise any 
one or more of a pharmaceutical^ acceptable diluent, earner, excipient or adjuvant. The choice of pharmaceutical 
5 carrier, excipient, adjuvant or diluent can be selected with regard to the intended route of administration and standard 
pharmaceutical practice. As indicated above, the pharmaceutical compositions may comprise as - or in addition to - 
the carrier, excipient, adjuvant or diluent any suitable binder(s), iubricant(s), suspending agent(s), coating agent(s) or 
solubilismg agent(s) 

[0329] In some embodiments of the present invention, the pharmaceutical compositions will comprise one or more 
w of an agent that has been screened by an assay of the present invention; an agent that is capable of interacting with 
SEQ ID NO; 1 or SEQ ID NO: 2 including derivatives, fragments, homologues, analogues or variants thereof or se- 
quences capable of hybridising to the nucleotide sequence shown in SEQ ID NO: 1 . 

[0330] Included in the scope of the invention are oligonucleotide sequences, antisense RNA and DNA molecules 
and ribozymes. which function to destabilise PFl-001 mRNA or inhibit translation of PFl-001 . 
15 [0331] A PFl-001 antisense molecule may provide the basis for treatment of various abnormal conditions related to, 
for example, increased PFl-001 activity. 

[0332] Expression vectors derived from retroviruses, adenovirus, herpes or vaccinia viruses, or from various bacterial 
ptasmids. may be used for delivery of recombinant PFl-00 1 sense or antisense molecules to the targeted cell population 
Methods which are well known to those skilled in the art can be used to construct recombinant vectors containing PFI- 

20 001 . Alternatively, recombinant PFl-001 can be delivered to target cells in liposomes. 

[0333] The full length cDNA sequence and/or its regulatory elements enable researchers to use PFl-001 as a tool 
in sense (Youssoufian H and HFLodish (1993) Mol Cell Biol 13:98-104) or antisense (Eguchi era/(1991) Annu Rev 
Biochem 60:631 -652) investigations of gene function. Oligonucleotides, designed from the cDNA or control sequences 
obtained from the genomic DNA can be used in vitro or in vivo to inhibit expression. Such technology is now welt known 

25 in the art, and sense or antisense oligonucleotides or larger fragments can be designed from various locations along 
the coding or control regions. Appropriate oligonucleotides, which can be 20 nucleotides in length, may be used to 
isolate PFl-001 sequences or closely related molecules from human libraries. 

[0334] Additionally, PFl-001 expression can be modulated by transfecting a cell or tissue with expression vectors 
which express high levels of a PFl-001 fragment in conditions where it would be preferable to block PFl-001 activity. 
30 Such constructs can flood cells with untranslatable sense or antisense sequences. Even in the absence of integration 
into the DNA. such vectors may continue to transcribe RNA molecules until all copies of the vector are disabled by 
endogenous nucleases. Such transient expression may last for a month or more with a non-replicating vector and even 
longer if appropriate replication elements are part of the vector system. 

[0335] Modifications of gene expression can be obtained by designing antisense sequences to the control regions 

35 of the PFl-001 gene, such as the promoters, enhancers, and introns. 

[0336] Oligonucleotides derived from the transcription initiation site, e.g. between -10 and +10 regions of the leader 
sequence, are preferred. Antisense RNA and DNA molecules may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. Similarly, inhibition can be achieved using Hogeboom base-pairing 
methodology, also known as "triple helix" base-pairing. Triple helix pairing compromises the ability of the double helix 

4 o to open sufficiently for the binding of polymerases, transcription factors, or regulatory molecules. 

[0337] Thus the invention provides a pharmaceutical composition comprising an agent of the present invention (or 
even a pharmaceutically acceptable salt thereof, or a pharmaceutically acceptable solvate thereof) together with a 
pharmaceutically acceptable diluent, adjuvant, exciptent or carrier. 

[0338] The pharmaceutical composition could be for veterinary (i.e. animal) usage or for human usage. 
45 [0339] Thus, the present invention therefore also relates to pharmaceutical compositions comprising effective 

amounts of modulators (e.g. antagonists or agonists) of PFl-001 protein (including antisense nucleic acid sequences) 

in admixture with a pharmaceutically acceptable diluent, carrier, excipient or adjuvant (including combinations thereof). 

[0340] The present invention relates to pharmaceutical compositions which may comprise all or portions of PFl-001 

polynucleotide sequences, PFl-001 antisense molecules, PFl-001 polypeptides, protein, peptide or organic modulators 
50 of PFl-001 bioactivity, such as antagonists (including antibodies) or agonists, alone or in combination with at least one 

other agent, such as stabilising compound, and may be administered in any sterile, biocompatible pharmaceutical 

carrier, including, but not limited to. saline, buffered saline, dextrose, and water. 

GENERAL METHODOLOGY REFERENCES 

55 

[0341] Although in general the techniques mentioned herein are well known in the art, reference may be made <n 
particular to Sambrook et ai, Molecular Cloning, A Laboratory Manual (1989) and Ausubel et a/., Short Protocols m 
Molecular Biology (1999) 4 th Ed, John Wiley & Sons. Inc. PCR is described in US-A-46831 95, US-A-4800195 and US- 
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A-4965188 
DEPOS TS 

5 [0342] The following sample was deposited in accordance with the Budapest Treaty at the recognised depositary 
The National Collections of Industrial and Marine Bacteria Limited (NCIMB) at 23 St. Machar Drive. Aberdeen, Scotland, 
AB2 1 RY. United Kingdom on 23 August 2000: 
NCIMB number NCIMB 41065 is Escherichia coli Pfi-001. 

[0343] The depositor was Pfizer Central Research, Pfizer Limited, Ramsgate Road. Sandwich, Kent, CT13 9NJ, 
10 United Kingdom. 

[0344] One skilled in the art could readily grow the above-mentioned E. coli clone (NCIMB 41065) in Luna Broth 
containing ampicillin and isolate the plasmid DNA of the clone using the alkali lysis method as described in Sambrook, 
etal.. eds. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, New York, NY, USA. 
The di-aeoxy termination method as described by Sanger et al. (Proceedings of the National Academy of Sciences 

?5 (USA) (Dec. 1977). 74(i2):5463-5467) and modified by Applied Biosystems (see Applied Biosystems manufacturers 
literature) for fluorescent detection could then be used to sequence the DNA and identify PFI-001 . 
[0345] The present invention also encompasses sequences derivable and/or expressable from that deposit and 
embodiments comprising the same. The present invention also encompasses partial sequences derivable and/or ex- 
pressable from that deposit and embodiments comprising the same, wherein those partial sequences code for active 

20 polypeptides The present invention also encompasses proteins comprising sequences derivable and/or expressable 
from that deposit and embodiments comprising the same. The present invention also encompasses proteins comprising 
partial sequences derivable and/or expressable from that deposit and embodiments comprising the same, wherein 
those partial sequences code for active polypeptides. 

25 INTRODUCTION TO THE EXAMPLES SECTION, THE FIGURES AND THE SEQUENCE LISTING 

[0346] The present invention will now be described, by way of example only, with reference to the accompanying 
Figures and Sequence Listing in which:- 

[0347] Figure 1 shows a schema for the bioinformatic analysis of PFI-001 (db = database). 
30 [0348] Figure 2 shows a ClustalW alignment of PFI-001 with SWI088854IGALS_MOUSE G ALAN IN RECEPTOR 
TYPE 2 (GAL2-R) (GALR2). 

[0349] SEQ ID NO: 1 shows the nucleotide sequence coding for PFI-001 . The ATG translation initiation codon is 
indicated by the first three non-underlined letters. The stop codon is indicated by the last three non-underlined letters. 
The underlined sequence represents untranslated sequence. N.B, The fourth base of the nucleotide sequence of PFI- 

35 001 was changed from a cytosine (C) in the native sequence to a guanine (G) in the cloned sequence. Thus, the 
isolated native nucleotide sequence of PFI-001 begins "ATGCTG.. ", whereas the cloned nucleotide sequence of PFI- 
001 begins "ATGGTG ... H . This base change was carried out in order to generate an optimised "Kozak" sequence m 
order to permit high level expression of PFI-001 in mammalian cells. This base substitution resulted in a change of the 
amino acid sequence (see SEQ ID NO: 2 below). 

40 [0350] SEQ ID NO: 2 shows the corresponding amino acid sequence coding for PFI-001 . In view of the manipulation 
of the native nucleotide PFI-001 sequence as described above, the second amino acid residue was changed from a 
leucine (L) in the native sequence to a valine (V) in the cloned sequence. Thus, the native amino acid sequence of 
PFI-001 begins "MLA...", whereas the ammo acid sequence expressed by NCIMB 41065 begins "MVA..." 
[0351] SEQ ID NOS: 3 and 4 show the PCR primers used in the Examples. 

4 $ [0352] The polynucleotide which encodes the GPCR of the present invention was cloned and the DNA and ammo 
acid sequences analysed using various bioinformatic tools. The GPCR encoded by the sequences described herein 
has been termed PFI-001 . 

Examples 

50 

IDENTIFICATION OF PFI-001 

[0353] PFI-001 was identified in unannotated genomic sequence from chromosome 5 (GenBank contig AC01 1375) 
which was released by the Genome Sequencing Centers by searching the sequences with known members of the G- 
55 protein coupled receptor (GPCR) family using the BLAST algorithm. 
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BIOINFORMATIC STUDIES 

[0354] In order to confirm that PFI-001 was a member of the GPCR family, a number of biomformatics approaches 
were performed. 

5 

(a) BLAST Search against Swissprot 

PFI-001 was searched against Swissprot using the BLAST algorithm (Basic Local Alignment Search Tool 
(Altshul SF (1993) J.Mol Evot. 36:290-300; Altshul, SF et al (1990) J. Mol. Biol. 215:403-410)) to identify the 
closest protein match. In this case the top hit was to: 
10 SWI088854IGALS_MOUSE GALANIN RECEPTOR TYPE 2 (GAL2-R) (GALR2). 

These results indicate that PFI-001 is a member of the GPCR family. 

(b) ClustalW Alignment of PFI-001 with SWI088854IGALS_MOUSE GALANIN RECEPTOR TYPE 2 (GAL2-R) 
(GALR2) 

15 These results are shown in Figure 2. 

(c) BLAST search against a non-redundant human GPCR database 

PFI-001 was searched against a non-redundant human GPCR database comprising mainly sequences from 
Genbank and Geneseq Patents databases in order to identify the class of agonist for this receptor. The top ten 
20 hits are shown below: 





043603 GALR2 : 


e-value 




3e-23, 


°oldentity 


- 25% 




P4271 1 GALR : 


e-value 




2e-2L 


°oldentity 


= 26% 




060755 GALANIN2 : 


e-value 




5e-21 ( 


%ldentity 


= 25% 


25 


M84605 NK4R : 


e-value 




Ie-17, 


°'oldentity 


= 23% 




P32745 SSR3 : 


e-value 




2e-17, 


0/ 0 ldentity 


- 24% 




P29371 NK3R : 


e-value 




Se-17, 


%identity 


= 23% 




P30872 SSR1 : 


e-value 




3e-16, 


^Identity 


= 23% 




P50052 AT2 : 


e-value 




6e-16. 


%ldentity 


= 25% 


30 


P50391 NY4R : 


e-value 




8e-16. 


%ldentity 


= 24% 




N76675 NK1R : 


e-value 




ie-15. 


%ldentity 


= 23% 



(e value = statistical likelihood of the hit occurring by chance) 

These results demonstrate that PFI-001 is most closely similar to galanin receptors and they suggest that PFI- 
35 001 encodes a novel GPCR whose ligand is likely to be a peptide. 

(d) Profile hidden Markov model 
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HIDDEN MARKOV MODEL RESULTS: TOP BLAST HIT: 

Top Model: 2nd Model: 3rd Model: 4th Model: 5th Model: 6th Model: Name of top hit: 

galanin leukotnene neuro- releasing angio- orexin2 Galanin receptor type 2 (GAL2-R) 

peptide Y Hormone tensin 

h Values 1 2U6-09 0 021 i W 4 5 7 3 00E-22 (25% identity) 



Thus, HMM analysis suggests that the ligand of PFi-001 is most likely to be galanin. 
ISOLATION OF PFI-001 

[0355] Full-length coding sequence of PFI-001 was cloned from human genomic DNA (Promega) using PCR. 
[0356] PCR reaction: PCR reaction was set up as follows: dNTPs (10 mM) - 1 uJ, Forward Primer (10 u.M) - 1 ^l. 
Reverse Primer (10 j^M) - 1 u.l, 5x reaction buffer - 10 uJ, Elongase (Life Technologies, Inc.) - 1 \x\, Genomic DNA -1 
pg; made up to 50 \±\ with water. 

PCR primers: 

[0357] Forward Primer + Kozak sequence around ATG (= PfiOl -1 (forward)): 
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5' - CAACCATGGTGGCAGCTGCCTTTGC -3* (SEQ ID NO: 3) 

[0358] Reverse Primer (= PfiOi-2 (reverse 1 ) 

5'- GGTTCTCTACTTACCACATTGC -3' (SEQ ID NO: 4) 



10 



[0359] PCR conditions: 94°C - 3 minutes Then 30 cycles of 94°C - 1 minute, 58°C * 1 minute. 72°C - 2 minutes 
Fnal cycle = 72°C - 10 minutes. 

[0360] The PFI-001 PCR product was gel extracted using the QlAgen gel extraction kit, according to the manufac- 
turer's instructions. The resultant product was TA cloned (Invitrogen TA cloning methodology) into the vector 
'5 pcDNA3 i/V5-His-TOPO (invitrogen), according to the manufacturer's instructions. 

[0361] it will be appreciated that the foregoing is provided by way of example only and modification of detail may be 
made without departing from the scope of the invention. 
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SEQUENCE LISTING 
(which forms part of the description) 



SEQ ID NO: 1 



Nucleotide sequence coding for PFI-001 



ATGGTGGCAGCTGCCTTTGCAGACTCTA.A.CTCCAGCAGCATGAATGTGTCCTTTGCT 

CACCTCCACTTTGCCGGAGGGTACCTGCCCTCTGATTCCCAGGACTGGAGAACCAT 

CATCCCGGCTCTCTTGGTGGCTGTCTGCCTGGTGGGCTTCGTGGGAAACCTGTGTGT 

GATTGGCATCCTCCTTCACAATGCTTGGAAAGGAAAGCCATCCATGATCCACTCCC 

TGATTCTGAATCTCAGCCTGGCTGATCTCTCCCTCCTGCTGTTTTCTGCACCTATCCG 

AGCTACGGCGTACTCCAAAAGTGTTTGGGATCTAGGCTGGTTTGTCTGCAAGTCCT 

CTGACTGGTTTATCCACACATGCATGGCAGCCAAGAGCCTGACAATCGTTGTGGTG 

GCCAAAGTATGCTTCATGTATGCAAGTGACCCAGCCAAGCAAGTGAGTATCCACAA 

CTACACCATCTGGTCAGTGCTGGTGGCCATCTGGACTGTGGCTAGCCTGTTACCCCT 

GCCGGAATGGTTCTTTAGCACCATCAGGCATCATGAAGGTGTGGAAATGTGCCTCG 

TGGATGTACCAGCTGTGGCTGAAGAGTTTATGTCGATGTTTGGTAAGCTCTACCCA 

CTCCTGGCATTTGGCCTTCCATTATTTTTTGCCAGCTTTTATTTCTGGAGAGCTTATG 

ACCAATGTAAAAAACGAGGAACTAAGACTCAAAATCTTAGAAACCAGATACGCTC 

AAAGCAAGTCACAGTGATGCTGCTGAGCATTGCCATCATCTCTGCTCTCTTGTGGCT 

CCCCGAATGGGTAGCTTGGCTGTGGGTATGGCATCTGAAGGCTGCAGGCCCGGCCC 

CACCACAAGGTTTCATAGCCCTGTCTCAAGTCTTGATGTTTTCCATCTCTTCAGCAA 

ATCCTCTCATTTTTCTTGTGATGTCGGAAGAGTTCAGGGAAGGCTTGAAAGGTGTAT 

GGAAATGGATGATAACCAAAAAACCTCCAACTGTCTCAGAGTCTCAGGAAACACC 

AGCTGGCAACTCAGAGGGTCTTCCTGACAAGGTTCCATCTCCAGAATCCCCAGCAT 

ccataccagaaaaagagaaacccagctctccctcctctggcaaagggaaaactga 

GAAGGCAGAGATTCCCATCCTTCCTGACGTAGAGCAGTTTTGGCATGAGAGGGACA 

CAGTCCCTTCTGTACAGGACAATGACCCTATCCCCTGGGAACATGAAGATCAAGAG 

AC AGGGGAAGGTGTT AAATAG ATTTAAGTTTC AAAGC AAA AC AAACTGTG ATTATT 

GTATTTACTTGTACTGCTGCTTATCAATATTC.CTGACTTTACAAACTGATATAATTA 

TTACCATTAGGAATTATAAAAATATTTCACAATCTACACTTTCCAAATGTGCAATGT 

GGTAAGTAGAGAACCACCCAGCTTTCTTGTACAAACTAGTCCCCAAGGCC 



1 090969A 1 I 
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SEQ ID NO: 2 



Ammo acid sequence codine for PFl-OOl 



MVAAAFADSNSSSMNVSFAHLHFAGGYLPSDSQDWRTI1PALLVAVCLVGFVGNLCVI 
10 GILLHNAWKGKPSMIHSLILNLSLADLSLLLFSAPIRATAYSKSVWDLGWFVCKSSDWF 
IHTCMAAJCSLTIVVVAKVCFMYASDPAKQVSIHN^ 

TLRiniEGVEMCLVDVPAVAEEFMSMFGKLYPLLAFGLPLFFASFYF\^^ 
TKTQNTRNQIRSKQVTVMLLSIAJISALLWLPEWVAWLV^WHLKAAGPAPPQGFIALS 
75 QVLMFS1SSAM>LIFLVMSEEFREGLKGV\VKWMITK^ 

PSPESPASIPEKEKPSSPSSGKGKTEKAEIPILPDVEQFWHERDTWSVQD>nDPff 
QETGEGVK 



20 



25 



SEQ ID NO: 3 



30 



CAACCATGGTGGCAGCTGCCTTTGC 
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SEQ ID NO: 4 



40 



GGTTCTCTACTTACCACATTGC 



Claims 



1. 



An isolated and/or purified polynucleotide comprising one or more of 
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(a) a polynucleotide encoding the polypeptide as set forth in SEQ ID NO: 2; 

(b) a polynucleotide comprising a nucleotide sequence of SEQ !D NO 1 ; 

(c) a polynucleotide encoding the polypeptide expressed by the DNA contained in NCIMB 41055; 

(d) a polynucleotide comprising a nucleotide sequence that has at least 70% identity to the polynucleotide of 
any one of (a) to (c); 

(e) a polynucleotide comprising a nucleotide sequence which is capable of hybridising to the polynucleotide 
of any one of (a) to (d); 

(f) a complement to the polynucleotide of any one of (a) to (e); or 

(g) a polynucleotide fragment of the polynucleotide of any one of (a) to (f). 
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2. The polynucleotide of claim 1 , comprising a nucleotide sequence that has at least 75% identity to the polynucleotide 
cf any one of (a) to (c). 

3. The polynucleotide of claim 1 , comprising a nucleotide sequence that has at least 80° o identity to the polynucleotide 
cf any one of (a) to (c). 

4. The polynucleotide of claim 1 , comprising a nucleotide sequence that has at least 85°o identity to the poly nucleotide 
of any one of (a) to (c). 

5. The polynucleotide of claim 1 , comprising a nucleotide sequence that has at least 90° o identity to the polynucieotiae 
of any one of (a) to (c). 

6. The polynucleotide of claim 1 , comprising a nucleotide sequence that has at least 95% identity to the polynucleotide 
of any one of (a) to (c). 

7. The polynucleotide of any one of the preceding claims which encodes a G-protem coupled receptor (GPCR). 

8. A polynucleotide probe or primer comprising at least 15 contiguous nucleotides of the polynucleotide of any one 
of the preceding claims. 



9. A vector comprising the polynucleotide of any one of the preceding claims. 

10. A host cell transformed or transfected with the vector of claim 9. 

25 11. The transformed/transfected host cell of claim 10 which is a mammalian, insect, fungal, bacterial or yeast cell. 

12. The transcribed RNA product of the polynucleotide of any one of claims 1 to 8. 

1 3. An RNA molecule or a fragment thereof which is antisense in relation to the RNA product of claim 1 2 and is capable 
30 of hybridising thereto. 

14. A ribozyme or zinc finger protein capable of binding to the polynucleotide of any one of claims 1 to 8. 

15. A process for producing a polypeptide or fragment thereof comprising culturing the transformed/transfected host 
35 cell of claim 10 or claim 11 under conditions sufficient for the expression of said polypeptide or fragment. 

16. The process of claim 15, wherein said polypeptide or fragment is expressed at the surface of said cell. 

17. The process of claim 15 or claim 16 which further includes recovering the polypeptide or fragment from the culture. 

40 

18. A process for producing cells capable of expressing a polypeptide or fragment thereof comprising transforming or 
transfecting cells with the vector of claim 9. 

19. Cells produced by the process of claim 18. 

20. A membrane preparation of the cells of claim 19. 

21 . A polypeptide or a fragment thereof produced by the process of any one of claims 1 5 to 1 8. 
50 22. A polypeptide comprising: 

(a) a polypeptide having the deduced amino acid sequence translated from the polynucleotide sequence m 
SEQ ID NO 1 and variants, fragments, homologues, analogues and derivatives thereof: 

(b) a polypeptide of SEQ ID NO: 2 and variants, fragments, homologues. analogues and derivatives thereof; or 
55 (c) a polypeptide encoded by the cDNA of NCIMB 41065 and variants, fragments, homologues, analogues 

and derivatives thereof. 

23. An antibody against the polypeptide of claim 22. 
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24. A compound which modulates the polypeptide of claim 22 

25. A compound according to claim 24 which antagonises or selectively antagonises the polypeptide of claim 22 

5 26. A compound according to claim 24 which agonises the polypeptide of claim 22 

27. A pharmaceutical composition comprising the antibody of claim 23 or the compound of any one of claims 24 to 26 
and one or more pharmaceutical^ acceptable carriers, diluents, adjuvants or excipients. 

10 28. A method for identifying a compound which binds to and modulates the polypeptide of claim 22 comprising con- 
tacting said polypeptide with a candidate compound and determining whether modulation occurs. 

29. A method according to claim 28. which comprises: 

is (a) contacting a compound with cells expressing the polypeptide of claim 22 on their surface, said polypeptide 

bemg associated with a second component capable of providing a detectable signal in response to the binding 
of a compound to said polypeptide; said contacting being under conditions sufficient to permit binding of com- 
pounds to the polypeptide; and 

(b) identifying a compound capable of polypeptide binding by detecting the signal produced by said second 
20 component. 

30. A method according to claim 29, which comprises: 

(a) contacting (i) a detectable first component known to bind to the polypeptide of claim 22 and (ii) a compound, 
25 with cells expressing the polypeptide of claim 22 on their cell surface, said polypeptide being associated with 

a second component capable of providing a detectable signal in response to the binding of a compound to 
said polypeptide; said contacting being under conditions sufficient to permit binding of compounds to the 
polypeptide: and 

(b) determining whether the first component binds to the polypeptide by detecting the absence or otherwise 
30 of a signal generated from the interaction of the first component with the polypeptide. 

31 . A method according to any one of claims 28 to 30, wherein said compound binds to and antagonises or selectively 
antagonises the polypeptide of claim 22. 

35 32. A method according to any one of claims 28 to 30, wherein said compound binds to and agonises the polypeptide 
of claim 22. 

33. The antibody of claim 23, the compound of any one of claims 24 to 26, or the composition of claim 27 for use as 
a pharmaceutical. 

40 

34. Use of the compound of any one of claims 24 to 26 in the manufacture of a medicament for the treatment of a 
patient having need to modulate the polypeptide of claim 22. 

35. Use according to claim 34, in the manufacture of a medicament for the treatment of a patient having need to 
^5 antagonise or selectively antagonise the polypeptide of claim 22. 

36. Use according to claim 34, in the manufacture of a medicament for the treatment of a patient having need to 
agonise the polypeptide of claim 22. 

so 37. A method for the treatment of a patient having need to modulate the polypeptide of claim 22 comprising adminis- 
tering to the patient a therapeutically effective amount of the compound of any one of claims 24 to 26. 

38. The method according to claim 37, wherein the method is for the treatment of a patient having need to antagonise 
or selectively antagonise the polypeptide of claim 22 

55 

39. The method according to claim 37, wherein the method is for the treatment of a patient having need to agonise 
the polypeptide of claim 22. 
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40. The method according to any one of claims 37 to 39, wherein said compound is a polypeptide and a therapeutically 
effective amount of the compound ts administered by providing to the patient DNA encoding said compound and 
expressing said compound in vivo. 

41. Use of the antibody of claim 23 in the manufacture of a medicament for the treatment of a patient having need to 
modulate the polypeptide of claim 22. 

42. Use according to claim 41, in the manufacture of a medicament for the treatment of a patient having need to 
antagonise or selectively antagonise the polypeptide of claim 22. 

43. Use according to claim 41 , in the manufacture of a medicament for the treatment of a patient having need to 
agonise the polypeptide of claim 22. 

44. A method for the treatment of a patient having need to modulate the polypeptide of claim 22, comprising admin- 
istering to the patient a therapeutically effective amount of the antibody of claim 23. 

45. The method according to claim 44, wherein the method is for the treatment of a patient having need to antagonise 
cr selectively antagonise the polypeptide of claim 22 

46. The method according to claim 44, wherein the method is for the treatment of a patient having need to agonise 
nc oolypeptide of claim 22. 

47. Cells genetically engineered ex vivo or in vivo to express, overexpress, underexpress or to exhibit targeted insertion 
cr deletion of the polypeptide of claim 22. 



BNSDOCID <EP_ ..1090969A1 ..( > 



41 



EP 1 090 989 A1 



Figure 1 



BLAST search 
against 
SwissProt 

^j^b^ Identify gene family 

ClustalW comparison 
with SwissProt top hit 

^j^^^ Demonstrates global alignment 

BLAST against non- 
redundant &PCR db 



Demonstrates GPCR sub-family 



Analysis of sequence 
using hidden Markov 
model 

V— 




BNSDOCID <EP 1090969A1 I > 



42 



EP 1 090 989 A1 



Figure 2 



CLUSTAL W (1.74) multiple sequence alignment 



GALS_MOUSE MNGSDSQGAEDSSQ EGGGGWQPEA VLVPLFFAL1 FLVGAVGNALVLlAV 

temp MLAAAFADSNSSSMMVSFAHLHFAGGYLPSDSQUWRTIlPALbVAVCLVGFVGNLCVlGl 

* . : .::.** ..**:*. * : . : 

GAX.S_MOUSE LLRGGQAVS TTNL - - F I LNLGVADLC F I LCCVP FQAT I YTLDDWVFGS LLCKAVHFL I FL 
temp LLHNAWKGKPSMIHSLiILNLSLADLSLLLFSAPIRATAYSKSVWDLGWFVCKSSDWFIHT 

* * * • * *****.:***.::* * : . * :* 

GALS_MOUSE TMHASSFTLAAVSLDRYIAIRYPMHSRELRTPRNAI^I^ 

temp CMAAKSLTIVWAKVCFWASDPAKQVSIHN-YTIWSVLVAIWTVASLLPLPEWFFSTIR 
GALS_MOUSE QLANLTVCHPAWSAPRRRAMDLCTFVFS YLLPVLVIjSLTYARTLHY 

temp HHEGVEMCL - - VDVP - AVAEEFMS - MFGKL.YPLLAFGLPLFFAS FYFWRAYDQCKKRGTK 

; . . * *::;:*.**:*-:-*. :.*:**:* *: 

GALS_MOUSE SQ- -R - AKR - - KVTRMI VI VAVLFCLCWMPHHALILCVWFGRFPLTRATYALRILSHLVS 
temp TQNLRNQIRSKQVTVMLLSIAIISALLWLPEWVAWLWVWHLKAAGPAPPQGFIALSQVLM 
. * * * .*.. *★.* * * * 



GALS_MOUSE YANSCVNPIVYALVSKHFRKGFRKICAGLLRRAPRRASGRVCILAPGNHSGG- -MLE- PE 
temp FSISSANPLIFLVMSEEFREGLKGVWKWMITKKPPTVSESQETPA-GNSEGLPDKVPSPE 
:;*..**:::::*:.**:*::: :::* .* * ** . * * * 

GALS_MOUSE S -TDLTQVSEAAGP LVPAPALPNCTTL - - SRTLDPAC 

temp SPASIPEKEKPSSPSSGKGKTEKAEIPILPDVEQFWHERDTVPSVQDNDPIPWEHEDQET 



GALS_MOUSE 

temp GEGVK 



( temp ~ native PFI-001 nucleotide sequence ) 
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